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Abstract: This paper is a part of a series of works where we in detail examine the concept 



> 

^»0 ' of Transverse Momentum Dependent (TMD), or A;j_, factorization, which is frequently 

encountered in the literature and is widely used in the phenomenological applications of 
QCD at very high energies. We address the question of what exactly factorization is, as it is 
ff^ \ meant in different contexts and formalisms, and we compare the formalisms to each other. 

We clarify some basic concepts regarding factorization and how it exactly is applied in high 
energy QCD, and we make important notes on some key and fundamental points that are 
often overlooked. We offer an extensive analysis of single inclusive particle production, and 
we analyze the TMD gluon distribution that plays a pivotal role in high energy QCD. 
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1. Introduction 

Parton distributions, supplemented by factorization theorems, play a crucial role in the 
understanding and exploration of QCD [1]. In formulating factorization theorems it is de- 
sirable to make as little approximations in the kinematics as possible, so as to capture more 
of the underlying dynamics. Frequently then one encounters the concept of transverse- 
momentum-dependent (TMD), or fc^-dependent, parton distributions which follow from 
TMD factorization (A;_L-factorization). The TMD distributions are important because they 
capture more of the parton kinematics than do the canonical integrated parton distribu- 
tions, the PDFs, and they therefore play an important role in the study of less inclusive 
hadronic observables which are sensitive to the details of the parton kinematics [2] . 

In the high energy, small-x, limit of QCD even inclusive cross sections are sensitive to 
the TMD distributions, as the so-called Regge kinematics is dominated by the transverse 
components of the momenta. Large contributions arise from large rapidity separations, 
and the typical contributing momenta are slightly off-shell, the off-shellness determined by 
the transverse momentum. Much of the intuition about the TMD distributions is based on 
concepts directly borrowed from the parton model, and it is for example very frequent to 
find in the literature the assertion that the TMD parton distributions are field theoretical 
number densities, and for example that the underlying mechanism of the phenomenon of 
saturation is related to the saturation of the phase space occupation number of gluons in 
a hadron, thus implying that there is a upper limit for the number of partons per phase 
space in the hadron wave function. 

While intuitive notions may be helpful in interpreting the dynamics, what is important 
is the exact formulation of TMD factorization that is a must for any proper definition of 
the relevant parton distribution, and the resulting distribution may or may not have the 
number density interpretation. In the small-x literature we find many statements regarding 
factorization, yet looking closely at these statements, we find that the necessary proofs are 
not always provided. We have moreover found different meanings attached to the word 
"factorization" , and we therefore take the task of illuminating what exactly is being meant 
in different formalisms. We will do this in section |^ where we compare different formalisms 
with each other. 

We should here mention that when we do speak of factorization we shall sometimes 
use different names to distinguish different formalisms. For example, we frequently use the 
words "hard scattering factorization" with which we are referring to the basic factorization 
of QCD processes where a hard scattering is present [1,3-7]. The hard scale sets the relevant 
momentum scale by which contributions can be classified according to their power as 
being leading or suppressed. The latter classification is achieved using the power counting 



arguments of [8,9]. We will go through this factorization approach in section 3.1. We note 
that usually the hard scattering factorization is referred to as the "collinear factorization" 
while the small-x Regge type formalisms go under the name of "/c^-factorization" . This 
is rather misleading, however, since A;_L-factorization (TMD factorization) is also a central 
part of the hard scattering factorization approach so that it is important to realize that 
TMD factorization is not only relevant for small-x physics. Depending on the exact final 



state studied, TMD factorization is a necessary tool for QCD studies even when x is not 
small. We will in section ^ also go through the Color Glass Condensate [10-15] formalism 
which is based on a physical picture of classical color fields. One of our main objectives 
will be to compare the picture of factorization that emerges from the CGC with the hard 
scattering factorization approach. This is important and relevant for understanding much 
of the phenomenology based on these formalisms that is currently being used. 

In section Q we give a detailed analysis on the validity of factorization in single inclusive 



particle production at small-x. The main small-x formula, equation (4^), or some variation 
of it, has been widely used in the applications of particle production in proton- proton (pp), 
proton-nucleus (pA) and nucleus-nucleus (AA) collisions (see e.g. [16-32] and references 
therein). We shall examine the foundations of the formula, the arguments given for its 
validity, and we shall clarify the exact pre-factor involved in the formula (as there are 
variations in the literature regarding the pre-factor). Additionally we shall examine what 
exactly the definition of the corresponding TMD gluon distribution is. 

The standard arguments for the validity of the A;_|_-factorization formula are usually 
based on the use of the light-cone gauge. Here, simplifications occur because the leading 
gluon contributions are suppressed, and Faddeev-Popov ghosts are absent. However, there 
appear severe technical difficulties by the introduction of the unphysical singularities in the 
light-cone gauge propagators. One issue is that these can potentially obstruct the contour 
deformations that are needed for the complete proof of factorization. Additionally, for the 
TMD distributions, the singularities of the gauge propagator imply rapidity divergences 
starting from one loop order, and one must then consistently regularize those divergences. 

While in the moderate-x region the important gluon momenta are collinear to the 
hadron momentum, in the small-x region one enters the Regge kinematics where actually 
the transverse momentum components are dominating. If k is the gluon momentum then 
k^k~ <C /c^^. In this case the gluons are also said to be in the Glauber region. In light-cone 
gauge then, transversely polarized gluons are no longer power-suppressed. This complicates 
the general treatment because one can then have arbitrarily many transversely polarized 
gluons exchanged without power-suppression. To remove the extra gluon contributions and 
establish factorization, one must then be able to perform contour deformations on the loop 
momenta out of Glauber region. It is then important that the unphysical singularities in 
the gauge propagators do not block the necessary contour deformations. 

In reference [33] it is shown at least in the deep inelastic scattering of a color-singlet 
gauge invariant gluon current on a hadron that the contour deformations are possible in 
low order graphs. However, in [33] specific assumptions are made on the target state that 
make the application of the Ward identities simpler, at least for the low order graphs. 
Going to higher order graphs, however, complications can easily arise, and a systematic 
treatment is therefore needed. We will examine the applications of axial gauge on the 



particle production process in sections 4.3, 4.4 and 4.5, addressing in particular the ability 



of making the necessary contour deformations. 

Apart from the technical details of the proof of factorization, another issue we address 
here concerns the exact definition of the TMD gluon distribution that is associated with 
the factorization formula, equation ( [4.8[) . The definitions found in the literature all center 



around the so-called "dipole gluon distribution" that is related to a (slightly modified) 
Fourier transform of the coordinate space dipole scattering amplitude, see equations ( |2.1C| ) 



and (4.11). In the arguments leading to the factorization formula, however, one makes use 
of the axial gauge. In the axial gauge, one necessarily obtains a definition for the gluon 
distribution that is an expectation value over the transverse gluon fields, (A^A^). This 
is canonically identified, not with the dipole distribution, but with the so-called small-x 
Weizsacker- Williams (WW) distribution which is meant to represent a number density of 
gluons [34-39]. The WW distribution naturally appears also in the calculation of certain 
classical quantities, such as the energy density of the so-called Glasma, see for example [40]. 
There is therefore a potential confusion as to what exactly the gluon distribution is, this is 
for example apparent in reference [41]. We discuss further the form of the gluon distribution 
in section |4.6| . 

We should also mention here that this work is part of a larger project initiated in 
order to understand the connections and differences between the various TMD factorization 
formalisms and the TMD gluon distributions which they give rise to. Related points that 
are not covered here will therefore be discussed and addressed in two separate papers 
[42,43]. 

This paper is somewhat long, the reason being that we cover a variety of topics which 
are important for the questions regarding factorization and the correct definitions of the 
TMD gluon distribution, and we do not wish to skip important and subtle points but rather 
try to explain and illuminate them, as this is the goal of our project. We have also aimed 
at providing a coherent exposition of the various topics that appear in different formalisms 
and different set of works but nevertheless all are centered around the concepts of TMD 
factorization and TMD parton distributions. We have therefore decided to present all the 
material in a single paper. We believe that it will be of interest for both experimentalists 
and theoreticians working on related topics. 

The paper is organized as follows. In section g we analyze and explain some funda- 
mental aspects of unintegrated parton distributions, starting from the elementary parton 
model definition. We concentrate on the two type of distributions commonly found in the 



small-x literature. Section ^ contains our main discussion on factorization. In section 3.1 



we provide an analysis of the hard scattering factorization approach which leads to both 



collinear and TMD factorization. Then in sections 3.2 and 3.2 we analyze the formula- 



tion of ^^-factorization in the small-x region and we compare these to the hard scattering 



TMD factorization. Section 3.4 gives an account of the formalisms that combine collinear 
factorization with the small-x formulas. Section |^ gives the detailed analysis of the single 
inclusive particle production in the small-x region as already explained above. We have 
divided this section into several subsections according to the different points we cover, as 
was summarized above. Finally, section g contains a brief summary. 

2. Unintegrated parton distributions 

Our aim in this section is to first recall the basic idea of parton densities. We will outline 
the basic definition as given by the parton model, and then shortly discuss some of the 



modifications induced by the dynamics of QCD. We also examine tlie validity of the intu- 
itive ideas borrowed from the parton model in the formulation of small- a; QCD. We will 
therefore here go through the commonly used "number density" and "dipole" distributions 
from the small-x literature. 

The concept of parton distributions dates back to the introduction of the parton model 
itself by Feynman [44, p. 135]. In there, partons of a particular flavor are considered to 
have a number density in the target hadron. While for the parton model calculation in 
DIS it is sufficient to consider number densities in the longitudinal momentum component 
X, the concept also naturally extends to a number density in both x and k^. The intuitive 
concept of a number density of partons can be formalized using light-front quantization 
and writing 



1 (^'^l4,«j«fc,«jl-f''^) 
2x(27r)3 {P,h\P,h) 






Here j and h label parton and hadron flavor, a is a parton helicity index, \P,h) is the 
target state of momentum P, and a' and a are parton creation and annihilation operators 
respectively. 

While intuitively clear, deflnition ( |2.1[ ) above is not really correct in full QCD, and it 
cannot be used in the exact form just given [1]. In the above formula for example, the kine- 
matic variables x and k± are literally the momentum fraction and transverse momentum 
of the parton probed by the electromagnetic current in DIS. Therefore the unintegrated 
distribution above is indeed a simultaneous distribution of the partons in both x and k±. 
In QCD, however, several modifications do occur. The variables x and k± no longer cor- 
respond to the literal momentum fractions of any single parton in the hadron state, and 
additional variables must be introduced which are connected to the divergences that occur 



in loop calculations (see section 2^ below and in addition the discussions in section pTp . 



2.1 The gluon "number density" 

It is in the small-x literature often implied that the TMD gluon distribution indeed has 
the meaning of a phase space number density as in the above formula. Thus we often find 
the statement of a certain "number of gluons per unit phase space". In the Color Glass 
Condensate (CGC) model at least, this statement is meant in the sense of the Weizsacker- 
Williams method of virtual quanta. We recall that in electrodynamics this method replaces 
the energy density of the classical electromagnetic field created by fast moving charged par- 
ticles by the equivalent field of pulse radiation. The latter is interpreted semi-classically 
as consisting of a distribution of energy quanta, that is, photons. From the average en- 
ergy density of the classical field, (l-Bp), one can then calculate the equivalent number of 
photons. This is the reason why the gluon distribution appearing in the CGC formalism 
is referred to as the Weizsacker- Williams (WW) gluon distribution. In the CGC then, one 
solves the classical Yang-Mills equations for the non-Abelian color field. The energy density 
of the classical field then relates to the equivalent number density of energy quanta, in this 
case identified with the gluons. In the light-cone gauge A~^ = one defines (for a hadron 



with large P"*") 

fww{x,k±) = ^— -3 {|aJ(x+fcX(x+fc) 

i,a 
i,a 

= 127iy3{PaHx^,k)Ft{x^,-k)) (2.2) 

i,a 

where k = {k~^,k±), and a and a\ as in ( |2.1| ), denote the parton (in this case gluon) 
annihilation and creation operators in the sense of light front quantization where x"*" plays 
the role of time. Notice that x = k'^ /P^ should not be confused with the time variable x"*". 
The last identity, (F"'"*F"^*), can be calculated in a classical approximation, for example 
using the McLerran-Venugopalan model [34,35], from which an explicit expression can be 
obtained for fww- 

The definition of the WW distribution is thus essentially identical to the parton model 
definition (|2.1|). One trivial difference is that, by convention, the 1/x term in (|2.1|) is 



not included in (2.2). As a less trivial difference we also note that while in (|2.1|) the 



quantum mechanical averaging is taken over the momentum eigenstates of the target, \P), 



in the CGC definition (12.21) one rather specifies a classical charge density p{x~,x±) in 
the transverse and longitudinal planes, and the classical averaging is then performed with 
respect to the specified profile, using a classical weight functional^ W[p]. One is then 
clearly not averaging over momentum eigenstates. The brackets are defined such that any 
function, O, of the classical source p has the average 

{0) = jDpO[p]W[p]. (2.3) 

This averaging is normalized to unity, so that (1) = 1, i.e. the classical weight functional 
W[p] is such that 



JvpW[p] 



(2.4) 

A gauge invariant version of (|2.2| ) can be written as (where we now expand the 

F~^^{x~^,k) in terms of F~^^ {x~^, x", x ±)) 

fww{xM) = j^ j dx-dy-j d''x^(fy^e'^P^^^'-y~'^-'''^^^^-y^^ 

{F+\x)Wa,{x,y)F+\y)) . (2.5) 

Here W denotes a Wilson line in the adjoint representation needed to make the operators 
within the expectation value gauge invariant. We write down the explicit definitions of the 
Wilson lines in the following sections. 



^This functional should not be confused with our generic notation for Wilson lines which is also W . We 
therefore always explicitly indicate the p dependence of the classical CGC functional and write W^[p]. 



2.2 The dipole gluon distribution 

The most commonly encountered "unintegrated gluon distribution" in the small-x formal- 
ism is actually different than the above distribution and is related to the so-called dipole 
scattering amplitude which itself is specified in coordinate space. The dipole scattering 
amplitude, and the associated "gluon distribution" appears as a result of the use of the 
dipole formalism [45-49] which canonically is applied to DIS at small-x. 

The basic object that enters any definition of the dipole "gluon distribution" is the 
coordinate space dipole "scattering amplitude" , N . The standard definition of this object 
in DIS, or in 7*7* scattering is given by (see for example [39,50-52]) 

M{x^,yr,y) = 1 - ^ {^r{W\x^)W{y^)])^, (2.6) 

where we shall freely switch between the coordinates xj_ and y^, and 

r_L = x_L - ?/±, (2.7) 

6± = (x± + y±)/2, (2.8) 

which are respectively the dipole "size" and "impact parameter" in transverse coordinate 
space. In (p.6|), W denotes the eikonal Wilson line given by 

W{x^) = Pexp (-igs I d\n-A"'{xi_+\n)t'y\ . (2.9) 

Here P denotes path ordering with respect to A, and t^ is the SU(3) color matrix in the 
fundamental representation. The vector n is taken along the light-like direction, and the 
trace in (|2.6| ) is meant with respect to the color matrices tp- The assertion of the dipole 
model is that this quantity is relevant for DIS [49,53,54], 7*7* scattering [50,51], and also for 
quark, or prompt photon production in hadron-hadron collisions (see for example [55-57]). 
As for the momentum distribution referred to as the "dipole gluon distribution" [16, 
20,28,58], or also very commonly as simply the "unintegrated gluon density" [18,19,21- 
24,26,27], it is given by a modified Fourier transform of the dipole scattering amplitude. 
Most commonly we do in the literature find the definition 

fdip{k±;y) =C I dWdH^e-'''^-''^VlM{r^,br,y), (2.10) 

where now we have used the variables r± and b± instead of x± and y±. We write the 
pre-factor simply as C since there does not seem to be any universally accepted value for 
it, and different papers use different pre-factors. Note also that a fully gauge invariant 
definition of (p.6|), and therefore also of ( 2.10| ), requires that one also insert transverse 



gauge links at ±00. 

Formula (2.10) is not exactly linked to the parton model definition of the unintegrated 



gluon distribution in (|2.lD . It is therefore also distinct from the Weizsacker- Williams distri- 
bution, and also from the gluon distributions obtained in the TMD factorization approach 



that we go through in section 3.1.5 . We examine the derivation of the Wilson lines in the 
definition (|1|) in [42]. 



A version of the dipole gluon distribution in the adjoint representation appears also 
in single inclusive gluon production, equation ( [l.ll ), which we shall examine in detail in 
section ^. 

2.3 On the rapidity variable in the gluon distribution 

It is also common to denote the rapidity dependence of the dipole distribution ( |2.1C| ) 
by X, using y = Inl/x. We emphasize, however, that the rapidity variable in ( 2.1C| ) is 



conceptually different than the variable x which appears in (^^) and ( |2.2| ). In the dipole 
distribution, y = ln 1/x enters as a rapidity cut-off, either as the scale in the CGC formalism 
where the functional VF^fp] is evaluated, or as the non-zero slope of the Wilson lines in the 
formalism by Balitsky [50,51]. On the other hand, in (2.1), x = k~^/P~^, where fe"*" is the 



momentum of the parton entering the hard scattering. Similarly in the light-cone gauge 
definition of the WW distribution (|2.2| ) it again has the meaning of the momentum fraction 
of the gluon entering the hard scattering. Of course, to avoid rapidity divergences in ( ^.2| ) 
a cut-off must be inserted just as in (p.lO|) . There must therefore be present an additional 
variable, Ci which plays the same role as y = In 1/x in ( |2.10 ). Thus we have 



fww = fww{x, kr,0, (2.11) 

and we must generally distinguish x and (. It is customary to choose C = x where for 
example in DIS x is taken to be the Bjorken variable. 

One may then naturally ask why only y and k± appear in the definition of the dipole 
distribution. The answer is that k^ is actually set to (this is why the Wilson line ( |2.9| ) is 
integrated in x^ from — oo to -|-cxd). Thus the variable x which appears in fww is instead 
set to in fdip. If therefore for example the brackets in /^jp are evaluated fully in the 
classical approximation without any effects of quantum corrections, say in the MV model, 
then there is no x dependence, unlike fww which has a x dependence even in the classical 
computation. 

3. Factorization 

As the word "factorization" is often used in the literature, and as there are many formalisms 
which go under the name of "A;_L-factorization" , we want to examine these formalisms, to 
explain the similarities and the differences among them. We believe this to be a relevant 
task since it is important especially for the experimental community to have clear under- 
standing on what exactly is meant in the different formalisms. This is also of interest for 
theorists, however, and especially in the case of small-x physics where many statements are 
put forward, particularly regarding /c^-factorization and unintegrated parton distributions. 
We must then once for all analyze these statements and the assertions made. 

The original concept of factorization is to be found in the hard scattering factorization 
approach [3-7] where for a given process the contributing Feynman graphs are shown to 
be factorizable into different components each of which is associated with a particular 
type of momentum region. The leading momentum regions are determined by a power 



counting analysis that we go through in section 3.1.2. There is a hard part specified by 



the large momentum scale Q, and dominated by short distance, d ~ l/<5, contributions. 
The hard scattering factorization does not directly deal with the small-x region where ^/s 
is asymptotically large, and where there may or may not be present in addition the hard 
scale Q. For an up-to-date and comprehensive overview of factorization in QCD, see [1]. 



We go through the hard scattering factorization in section 3.1. 

After going through the hard scattering factorization, we shall in section 3.2 examine 
the basic aspects of the BFKL formalism [59-61]. Here the emphasis is put on the so-called 
Multi-Regge-Kinematics (MRK), and ideas borrowed from the pre-QCD Regge theory [62- 
64] play an important role. Even though the methods are rather different than the hard 
scattering factorization, one can actually identify a structure where different factors are 
associated with different momentum regions as in the hard scattering factorization (see [42] 
for further discussions). 

There is also the CCH approach [65-67] which is based on BFKL but is meant to 
build on a structure that is closely related to the hard scattering factorization since again 
emphasis is put on a hard scattering coefficient. We will here not go through CCH since 
we give a detailed analysis in [42]. In [42] we also go through in more detail the CCFM 
formalism [68-70] that is also based on the CCH approach and is meant to interpolate 
between the small-x BFKL formalism and the collinear limit at high Q encoded in the 
DGLAP evolution. 

There is then the CGC approach [10-15, 18, 22-25, 39] which uses a very different 
language in terms of classical fields, A^u and their corresponding sources, p. In this case 
emphasis is put on a power counting in gsP where the strong coupling Qs is taken as a 
fixed variable which can be made as small as possible. A difference between "dilute" and 
"dense" systems is emphasized, where for dilute systems gsP <^ 1 while for dense systems 
gsP ~ 1. The structure of the factorization formula is therefore rather different than 
the hard scattering factorization. We analyze factorization within the CGC formalism 



in section 3.3. We shall then in section 3.4 analyze some formalisms where the ideas of 



collinear factorization and the CGC are mixed. 

We may also mention the dipole approach encountered above where the scattering 
process of parton impinging upon a target hadron is modeled via the insertion of Wilson 
lines as in ( |2.9D , where for a quark the Wilson line is taken in the fundamental representation 
while for a gluon the color matrices in ( |2.9| ) are instead taken in the adjoint representation. 
The dipole formalism is easily embedded into the CGC picture because the CGC formalism, 
or the MV formalism, gives an explicit way of calculating the averages of the Wilson lines 
that are present in the dipole formalism. Actually factorization is more or less asserted in 
the dipole formalism. In [42,43] we analyze the underlying structure in more detail. 

3.1 Hard scattering factorization 

We now review and explain the factorization which is applied to processes where a hard 
scale is present. As we shall see, however, there is a structure which does not depend on the 
existence of the hard factor. It will then be important to understand the overall structure 
here, since it can also be applied to the Regge region. We will start with the most simple 
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Figure 1: DIS in the simple parton model. Right: Factorized structure in the parton model. 



case of the parton model, and then move on to the more complicated cases in QCD, and 
eventually to TMD factorization which is the main interest of this paper. 



3.1.1 Basic parton model 

In order to understand the basic idea of the hard scattering factorization, it is useful to first 
look at the interpretation of DIS within the parton model. The advantage of the simple 
parton model is that the intuitive ideas about the scattering and the structure of hadrons 
can be quantified in a mathematical manner which then paves the way for an understanding 
of the more complicated case of full QCD. The quantitive analysis of the model is simplified 
by the understanding of the kinematics involved, and in DIS it is convenient to consider the 
frame where the target hadron has momentum P = (P"^, rn-^/2P^, Oj_), while the virtual 
photon has momentum q = {q'^,q~ ,0±) where of course —2q'^q~ = Q^. The scattering in 
the parton model approximation proceeds as shown in figure |l] (left graph). The parton 
which is struck by the virtual photon has momentum k. In the rest frame of the target 
all the components of k are of the order of the typical hadronic scale m. A large boost in 
the plus direction then brings the momentum of P into the above form, and implies that 
/c+ is the largest component, being of order Q, while k~ and k± are of order m? /Q and 
m respectively. This corresponds to the region where the longitudinal momentum fraction 
^ = k'^ jP^ is not much smaller than 1. 

According to the parton model one can neglect the effects of the strong interaction 
during the time of the interaction with the photon, and all the effects of the long distance 
strong interactions is put into the parton distribution functions. This structure is shown 
in figure ffl (right graph) . In the upper part which contains the hard scattering, one can set 
k = ^P. In particular since the minus component of P is power suppressed with respect 
to the plus component, one can make the collinear approximation whereby only k~^ is kept 
in the calculation of the hard scattering coefficient. We denote by A; = (A;^, 0~, 0_l) the 
approximated momentum. 
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We define the DIS hadronic tensor W^'^ as 



(3.1) 



X 



A factorization formula using the basic assumptions of the parton model can then be easily 
obtained for W^'^. Using the general structure of the contributing graphs shown in figure 
IT], we can write the hadronic tensor as 



^'' = E§^I (^Tr 7-^,(^ + qh'^Hk, P) 



(3.2) 



where U refers to the upper part of the diagram while L refers to the lower blob. The 
trace refers to the Dirac trace. In the upper part only k~^ is important so we replace k by 
k. Then in the lower part one can replace k~^ — )• xP~^ since ^ = x{l + 0{w?' /Q"^)). Thus 
we get 






47r 



r 



dk^Uj{k+q-0±) 



7 



dk d^k± 



Lj{xP^,k-,k±,P) 



+ p.s.c. (3.3) 



where "p.s.c." stands for "power suppressed corrections" . To finally obtain a fully factorized 
structure we notice that the leading contribution from the lower part comes from the 
component which is enhanced by the factor Q in the boost along the plus direction from 
the hadron rest frame. Using Lorentz invariance, this leading component can be written 
as Li^ading = "y~ L^ = (l/4)Tr7+L. Thus the factorized structure is given by 



Wi"^ = Y^ Tr 

^-^ Ait 



fuA^P^,q-MY^ 



xTr 



dk d^k_L 1 



(27r)4 2 



-j+Lj{xP+k-k^,P) 



+ p.s.c. 



(3.4) 



The factor in the second row defines the unpolarized integrated quark distribution in the 
parton model and it can be shown to be equivalent to ( |2.1| ). The unintegrated density is 
obtained simply by undoing the k± integral. Thus 



(3.5) 



M0= d'k^j{C,k^) 



in the parton model. Note that the integral is over all k±. Actually as we review in detail 
in [42] , much of the literature on the TMD gluon distribution in small-x physics uses very 
much the same ideas as above. We shall also see in section ^ that very similar arguments 
are used in the treatment of single inclusive gluon production in small- a; QCD. 



11 




Figure 2: Left: Reduced graphs for SIDIS where a hadron with momentum pB is detected. Right: 
Reduced graphs for the Drell-Yan process of Icpton pair production in hadron-hadron scattering. 



3.1.2 On the leading momentum regions in field theory 

In trying to simplify generic graphs in a field theory, so as to extract a factorized form, it 
is important to systematically classify the structure of the leading contributions. In each 
graph at any given order in perturbation theory there may be many loop momenta that 
give rise to a rather complicated manifold of momentum regions. It turns out, however, 
that there is a correspondence between divergences in massless theories and the leading 
configurations in high-energy processes [8,9]. 

These leading regions are non-UV regions that are important when the hard scale Q 
gets large. The UV region for momenta above Q of course gives divergent contributions but 
these contributions are handled by renormalization which effectively cuts off the integrals 
above the renormalization scale fj, that conveniently may be taken as Q. 

If one considers the complex momentum plane, then as Q — t- oo, many of the momen- 
tum integrations can be deformed away from the propagator singularities, and those give 
therefore vanishing contributions at asymptotic Q. There may, however, be contributions 
which cannot be deformed away from the propagator poles. These contributions arise from 
surfaces in loop momentum space which are called "pinch-singular surfaces" (PSSs). The 
PSSs therefore give important contributions which must be taken into account. To deter- 
mine the strengths of the different PSSs a power counting analysis is employed. Via the 
power counting one also can see the appropriate approximations to be made in the different 
momentum regions, and this is highly relevant for factorization. 

The interesting regions where there might be large contributions to the graphs for 
any given process are thus regions where a given loop momentum k has small virtuality, 
|A;^| <^ Q^. Consider semi-inclusive DIS where a hadron of momentum pB is produced 
away from the target, i.e the large component of pB is its minus component. The target 
hadron has momentum p^ which is large in the plus direction. 

We show in figure (left graph) § a so-called "reduced graph" for the important PSSs. In 
obtaining a reduced graph from the full Feynman graph one contracts to points all the lines 
whose denominators are not pinched. This follows from the observation that those lines 
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in the limit Q^ — t- oo carry much larger momentum than the pinched lines and therefore 
in a space-time picture they would reduce to points. The regions H, Ca, Cb, S denote the 
different momentum regions where the momenta are large and of order Q (for H), collinear 
to PA (for Ca), collinear to pB (for Cb), and small of order m (for S). In the asymptotic 
limit, PA and pB become exactly light-like, and the exact PSSs correspond to these limits 
where the virtuality vanishes. Of course in the realistic (non-asymptotic) case the momenta 
are not exactly light-like so the exact PSSs form a sort of skeleton of the corresponding 
region (for example the PSS for Ca is the skeleton where the given momentum k is exactly 
parallel to the light-like limit oi pA, while the whole region of Ca also contains momenta 
which are approximately collinear to pa)- The soft PSS corresponds to the exact limit of 
S where all momentum components of k are 0. Thus in general, momenta belonging to S 
have all their component small (no component is enhanced by any factor of Q, and they 
stay fixed as Q — )• oo). The soft lines can therefore connect to any other region. If ks is 
a soft line and is added to say kA which is in Ca, then ks + kA still belongs to Ca- We 
notice, however, that lines in Cb and Ca cannot be directly added to each other because 
adding two light-like momenta in opposite directions gives a non-light-like momentum far 
off shell, and such a line does not belong to any of the two regions (it actually belongs to 
the hard region H). The collinear lines can, however, be added to the hard part since the 
result is again a hard momentum. Thus one finds the connections between the regions as 
in figure ^. We also show in figure § (right graph) the Drell-Yan lepton pair production 
where again there are two collinear regions associated with the incoming momenta pA and 
Pb, and in addition there is the hard part where all momenta are of order Q, and there is 
again the soft graph connecting possibly to any of the other regions. 

In a collinear pinch, say collinear to the -|- direction, the typical scales for the momenta 
are k~^ ~ Q, /;:" ~ rr? jQ and k^ ~ m. In the soft pinch on the other hand all components 
satisfy k^ ~ ?n,, while in the hard region the virtuality is large |/i:^| ~ Q^ . There can also 
be several collinear regions Cj in a given process. For example in DIS we can have several 
jets emerging from the hard scattering, each defining its own collinear region. Notice also 
that a single Feynman graph can have multiple leading PSSs. This is so because for any 
given momentum line k in the original graph, we have the possibility that k is in any of 
the allowed regions for that graph. 

Consider now in QCD gluons exchanged between the different regions. Let us assume 
we have a collinear-to-^ gluon k exchanged between the hard part H and Ca- We then 
have a contribution of the type 

H''N^,{k)C'^- (3.6) 

Since Ca contains momenta which are large in the -|- direction, the contribution propor- 
tional to C^ is boosted by a factor Q, and we see that the leading contribution satisfies 

H''N^,{k)C'^ « H-N+-{k)C+. (3.7) 

Similar relations hold for gluons exchanged between H and Cb- If, however, a gluon is 
exchanged between H and the soft region S, there is no large boost factor associated 
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Figure 3: A two loop contribution to the Sudakov form factor. 

with S. In fact the H-to-S coupHngs give power suppressed corrections and therefore the 
leading power contribution does not contain any lines attaching H to S (see below). As 
a simple example consider figure 1^ where a time-like photon q produces an exclusive pair 
of an anti-quark with large minus momentum pB , and a quark with large plus momentum 
PA (this is a two- loop contribution to the Sudakov form factor). In the Feynman graph 
shown in figure ^, one possibility is that the gluon ki is collinear to pA, while k2 is soft. It 
is then easily seen that pA — ki — k2 and pA — ^2 are collinear to pA, while pB + ki and 
Pb + ki + k2 are hard lines (since their virtualities are of order Q^). The reduced graph for 
this Feynman graph is shown in figure ^ (left graph). The contribution is proportional to 



,4 ^(^^)^.. ^A - h ^,, ^A-h-h iV,,.,(fcl) ^, 

'^ {pA - k2Y + ie {pa - ki - k2Y + ie kf + ie 



^B + h+h ,, fB + ^l u, , ,N^,,,{k2) 



{pB + ki + k2f + ie^ {pB + kiY + ie^ ^^^""^ kl + ie ' ^ ^ ^ 

To pick up the leading contributions we project out the + component inside the Ca part 
(which consists of the factors to the left of 7^^). This part can then be written as 

+ 2p+ f^-h N-+{k,) Q Q 1 

-2p\k^ + ie-2{p\-kf)k^ + ie kj + ie Q\sQ\s\\' 

Here we have introduced typical momentum scales for the collinear and soft regions, 
\a and As respectively, such that for any collinear-to-j4 {Ca) momentum, kA, we have 
k\ ~ A^, while for the soft momentum, ks, we have k^ ~ A^. Notice that since A;^ ~ Q, 
this means that k~^ ~ '^\/Q- The soft region in (^^) simply consists of the soft propagator 
1/A;| ~ l/-^s) ^-nd the momentum integral j d'^kg ~ J dXgX^. The coIlinear-to-S region, 
Cb, is elementary while the hard region power counts as 

Pb .- Pb .- Q_Q_ (^-.r.. 

2p^k+ + ie^ 2p^kt + ie^ ^ Q^ Q^' ^ ' 

The PSSs then give 

, , - ^ . 1 1 1 Q Q r^dXAfXAV r^dXs 



dXAXAl dXAsQ2jT^^Qj;QY^=J 1^[-q) J X- ^'-''^ 
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Figure 4: Examples of reduced graphs for the two loop Sudakov form factor. 



The complete result is given by multiplying ( 3.11 ) with the LO graph. 

In figure ^ (right graph) we show the case where both ki and k2 are soft gluons. Here, 
the hard part is elementary while the soft part now contains both gluon propagators. It is 
easy to see that we get in this case 



d^s,l^s,l / d^s,2^ 



Q' 



Q' 



1 1 



^s,2^s,2 



{QX 



sA 



9 ms,2Y xi, xi. 



dX 



sA 



^QdX 



s,2 



^s,2 



. (3.12) 



The contribution from the PSS ( |3.12[) as we see has no suppression compared to the LO 
graph, while ( |3.11| ) has a power suppression. The power suppression comes from the 
coupling of the soft part to the hard part.^ 

For a given amplitude, cross section or structure function to be analyzed we denote 
the leading power obtained by dimensional analysis as Q^, where p = 4 — El with El 
counting the number of external lines. For the Sudakov form factor in figure |3|, E^ = 3, so 
the lowest order contribution grows as Q. For DIS, El = 4 and the leading power is Q^. 
For a given PSS, we then generally have integrals of the form 



Q 



iPi 



^^dX 
X 



XP2 



(3.13) 



where pi and p2 are different powers. 

Making use of dimensional analysis and Lorentz invariance, one then finds in QCD the 
following results [1]: For a collinear region C, every line joining C to H gives a power X/Q 
except for longitudinally polarized gluons, carrying polarization N^ , for which there is no 
suppression. For the soft region, every gluon coupling S to H gives a factor X/Q (as in the 
example of ( |3.11| )) while every fermion gives {X/QY'^ . Every fermion coupling S* to C gives 
a factor {X/QY'"^ . Thus all couplings between S and other regions are suppressed, except 
for longitudinally polarized gluons between S and C for which there is no suppression. 



^It may seem in ( p.ll[ ) that performing the A a integral gives a contribution of order unity since we 
integrate all the way up to Q. However, the integral is completely dominated by the upper limit where 
the momentum is no longer coUinear-to-yl but is instead is hard. In the definition of the hard region there 
will be a subtraction of the smaller PSSs, for example Ca- That subtraction will cancel the dominant 



contribution of the integral and ensure that (3.11) is truly power-suppressed 
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Figure 5: Generic contribution to inclusive DIS in simplified case. 




Figure 6: Generic contribution to inclusive DIS. 



There is thus no penalty for coupling C and H^ and S and C via longitudinally polarized 
gluons. For more details, see [1,8,9]. In the cases where there is no suppression, the 



integrals ( 3.13 ) usually produce logarithms InQ^/wi^ that accompany the leading power 
(this is due to the renormalizable nature of QCD in which the coupling is dimensionless) , 
as for example in ( |3.12| ). 



3.1.3 Factorization in simple theory 

The results above show that in QCD one has to take into account arbitrarily many gluon 
exchanges, of longitudinal polarizations, between the different regions (except for S-to-H 
couplings which are always power suppressed regardless of polarization). The proof for fac- 
torization is then more complicated compared to the simple parton model in figure || where 
gauge bosons are not present. Let us first, however, study a simplified situation by using 
the results from the power counting. This example will be illustrative for understanding 
the small-x calculations in section Q 

In figure || we show an example of inclusive DIS where arbitrarily many gluons are 
exchanged between the lower part L, which is collinear to the target hadron P, and the 
upper part U, which contains the hard scattering. Of course where the final state cut goes 
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Figure 7: Pure gluonic contributions to DIS. Left: Tlie black squares indicate transversely 
polarized gluons while all other gluons are longitudinally polarized. Right: Longitudinally polarized 
gluons only give a super-leading contribution in the hard scattering region. 



through U , the cut lines are necessarily on-shell, but the bubble will still contain internal 
lines that are far off-shell. In a more complete picture one must consider instead the class 
of graphs shown figure g. It can, however, be shown in the inclusive case by a sum-over- 
cuts argument that the momenta in the collinear region can be deformed out to the region 
where it is far off-shell, effectively reducing the leading graphs to that shown in figure |5|. 
We thus treat the upper part of the diagram as the hard region. According to the analysis 
in the previous section, we then see that soft gluon couplings do not arise in the leading 
contributions. 

We notice that one may also consider pure gluon exchanges between the upper and 
lower parts. If all gluons are longitudinally polarized, i.e contributing via A^ '", then 
a super-leading contribution arises which has power Q'^ /m? relative to the leading case. 
However, Ward identities apply for these contributions, and a careful treatment shows that 
the super-leading piece actually cancels, leaving behind a remainder term that is leading 
only [71]. A leading contribution is also obtained when one of the gluons at each side of 
the cut is transversely polarized, we show this in figure (left graph) where we denote 
the transversely polarized gluons using the black squares. Pure gluon exchange terms are 
important for the analysis in the small-x region which we come back to later. 

The parton model result reviewed above can be exactly reproduced in a model field 
theory which is non-gauge (this removes all gauge boson attachments between L and U) 
and super-renormalizable (this implies that the hard part U is trivial as in figure |l|). As a 
simplified case we instead imagine a theory which is still non-gauge but is renormalizable. 
This means that the higher order corrections to the hard part are not power suppressed 
anymore. Moreover it means that one has to also take into account the UV renormalization. 
At the same time it implies that the gauge boson exchanges shown in figure g are absent, 
and one obtains instead figure |8|- Now, another way to think of this case is to actually 
consider full QCD in light-cone gauge A^ = 0. In this case the leading gluon coupling 
vanishes since 

N-+{k)=g-+ -^ = 1-1 = 0. (3.14) 



k+ 



n 
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Figure 8: Leading contribution in the simplified case in non-gauge theory (only left graph) or in 
light-cone gauge QCD (both graphs). 

Therefore in figure ||, all gluon couplings again vanish to leading power. In figure |^ it 
means on the other hand that only the two transversely polarized gluons remain, as shown 
in figure |8|. 

A factorization formula for figure ^ can now be obtained rather easily by assuming 
that there is a clear separation in momenta for the exchanged line k, namely that it can 
either be hard or collinear to P. We can then write the hadronic tensor as (neglecting 
photon indices) 



W 



(27r)4-2. 



U^''Hk,q)L^^y{k,P), 



(3.15) 



where the index {a} collectively denotes all relevant labels such as flavor, color, polariza- 
tion^. We again make the approximation of replacing k in U hy k = {k^, 0, 0^). Thus one 
gets 



W 



^>'*U^"Mk.,)k*l%^^L^^,(k.P). 



(27r 



(3.16) 



This formula is not yet in a fully factorized form, however, since there is still the sum over 
the labels {a}. We note that U must be diagonal in the color indices since the photon is 
color singlet. Consider first the quark contribution shown in figure]^ (left graph). To fully 
factorize F we can then apply exactly the same argument as in the parton model case in 
going from equation (p. 3]) to (3.4). We then get just as in (|3.4|) 



W^ /| 



TrC/,(eP+g-Ox)^ 



Tr 



dk-d^-^^k^ 1 



(27r)4-2e 2 



i^l^L,{k,P) 



(3.17) 



Summation over the color indices in L is kept implicit. Corrections to the factorization 



formula are power suppressed by the analysis in section 3.1.2, 



^Of course in a non-gauge theory we need not consider the color indices but as the analysis is also 
relevant for light-cone QCD we include all quantum labels. 
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Figure 9: Example of subtraction in the NLO gluon coefficient. The subtraction removes the 
contribution where the loop momentum I is target-coUinear, indicated by I in the last graph. 



For the gluon contribution shown in the right graph of figure ^ we instead find 



W 



k+ 



■U'^{k,q) k- 






{27rY 



(3.18) 



We then notice that the upper part U is diagonal in the transverse and color indices which 
gives the factorized form 



W -^ / ^ 



dC 



\u"{k,q) 



^P 



dk-d^-^'ki_ 



4-2e 



KMP+,k-,k^),p) 



(3.19) 



The second factor here defines, preliminarily, the integrated gluon distribution. We shall 
see in section Q that the elementary definition of the TMD gluon distribution in axial gauge 
in the small-x limit is given by the very same set of approximations. 

This simple derivation of factorization cannot be strictly true, however. Namely, the 
main assumption that a clear separation of scales is possible is not generally true in a 
renormalizable theory like QCD. For example in the above calculation we assume that 
A;_|_ ~ m, while the case k± r^ Q would have instead contributed to the next-to-leading 
order correction to the hard part H. There is, however, also an intermediate region, 
where in < k± < Q, and k is neither exactly target collinear nor exactly hard, and as a 
consequence it is not clear in the above formalism how to exactly handle k in this case. 
For the assumptions above to thus hold, it must be true that this intermediate region can 
be safely omitted. This is, however, not the case. In fact, the renormalizability of QCD 
implies that there are in general logarithmic contributions, 



^Q' dk^, 



k^, 



lnQV"i^ 



(3.20) 



There is therefore no power suppression of the intermediate region, and in fact it is 
even enhanced by a logarithm. A full treatment must therefore treat such regions correctly, 
and this can in general be done by a subtractive formalism [1]. This means that each PSS is 
defined with subtractions of the smaller PSSs that it contains, to prevent double counting 
and ensure that it indeed is dominated by the momenta associated with it. For the hard 
part U in figure |8|, one should therefore include a subtraction of the target-collinear PSS. 
We show examples of these subtractions in DIS for the gluon and quark contributions in 



19 






'Ts^sw: 



1S^ 



/ 



\ 



I 



/ 



^rOTTCOTT 



/ 



Figure 10: Example of subtraction in the NLO quark coefficient. The subtraction removes the 
contribution where the loop momentum / is target-collinear, indicated by / in the last graph. 



figures |^ and ^ respectively. If we denote by dll the phase space measure for the momenta 
contained in U then a more correct version of ( |3.19| ) reads 



W 






(M 



iP- 



U^^{k,q) — subtractions 
dk-d^-^'k 



LU{^P^,k-,k^),P) 



(27r)4-2^ 
The integrated (bare) gluon distribution is thus given by 



(3.21) 



(2vr) 



dx~ 



27r^PH 



J^P+x- 



(p|F+;jo+x^Ox)i^,+w(o)|p 



(0), 



(3.22) 



where the last result holds in A'^ = gauge in QCD, apart from some technical problems 
associated with this gauge that we are neglecting. 



As indicated in (3.22), the basic operator definitions of the parton distributions are for 
the bare fields of the Lagrangian. Note that it is these fields which have the canonical gauge 
transformation properties, and thus in discussing the gauge transformation properties of 
the parton distributions one necessarily refers to the operator definitions constructed out 
of the bare fields. The renormalization of the bare parton distributions is then an issue of 
the renormalization of non-local operators. While in the case of local field operators, the 
renormalization factor can be taken as a multiplicative constant which is independent of 
momenta and masses, for the non-local operators appearing in the definitions of the bare 
parton distributions one instead finds that there is a convolution with a renormalization 
factor. Basically if we denote the bare parton distribution for a parton of flavor j as 
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obtained from either ( |3.1^ ) or ( p.l9|) by /,• (^), and the renormahzed distribution by 
fj{^), we find 

fj{x;n) = limZjj,{^,fi,e) ^^ n?\x/S.; fJ.,e), (3.23) 



where the convolution is an integral in ^ as in ( ^.18 ) and ( 3.19 ). The evolution of /j(x; /u) 



with respect to n is given by the DGLAP equations. 

3.1.4 Including the gluons, and the Glauber region 

For a fully satisfactory treatment of factorization in full QCD one needs, however, to deal 



with the gluon emissions. As we recall from section 3.1.2 , in QCD we can without any 
power suppression exchange arbitrarily many longitudinally polarized gluons between the 
hard and collinear, and the soft and coUinear regions respectively. We indicated this pos- 
sibility already in figures ^ and ^. In the previous section we argued that in the collinear 
factorization of inclusive DIS at least, the structure of the leading graphs can be simpli- 
fied by choosing the light-cone gauge A'^ = which eliminates the leading longitudinally 
polarized gluons. 

There is, however, a good reason to try to avoid the light-cone gauge in the generic 



treatment (see also sections O, iA and [4.5| below). Note from the arguments in the pre- 
vious sections that the treatment of factorization is based on first analyzing the analytic 
structure of the Feynman graphs, identifying the PSSs, and then using power counting to 
extract the leading PSSs. To guarantee that the power counting arguments work properly, 
contour deformations must be performed when necessary. In particular, if fc is a momen- 
tum in the soft region, then there is the possibility that the components of k do not all 
scale with the same power A^, but that the longitudinal components k^ and k~ might be 
parametrically much smaller than k±. This happens if k~^ or k~ is pinched by the collinear 
lines it attaches to. For example, if k couples to a collinear line pA then a propagator. 



(pa + kf + ie, (3.24) 



arises. The pole for k is then 



2 

m 



fc" ~ -— - ie. (3.25) 



Thus k~ is parametrically much smaller than As ~ m. When this happens, we say the 
momentum is in the Glauber region, k^k^ <^ k'j_. Now, if no other such pole is present, 
or if all such poles lie in the same part of the imaginary plane (all below or above the real 
axis), then we can deform the contour away from this pole to keep k~ ~ A^. If, however, 
another pole exists simultaneously, such that 

771 

k-r^ — + ie (3.26) 

then the k~ contour is pinched, and cannot be deformed. It might still be possible to deform 
on k'^ but if not, then the standard power counting fails. The longitudinal polarizations 
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then no longer dominate and one cannot use the eikonal approximations needed to obtain 
factorization. 

The use of the hght-cone gauge imphes that the analytic structure of the individual 
Feynman graphs is altered, since now an additional pole l/k'^ is introduced with each 
propagator. This has obvious implications for the factorization proofs. These poles might 
for example introduce pinch points that are not present in a covariant gauge. Moreover, 
the gauge poles l/Zc"*" commonly give rise to integrals of the form 

dA:+-i-I(A:+,A:^), (3.27) 

and these diverge as A;+ — t- 0. Notice that the divergences arise from end point singularities 
and can therefore not be treated by any ie prescription or principal value. In fact there 
exists no generalized function which is a "canonical regularization" , in the sense described 
in [72] , of this integral. 



These divergences are in fact the rapidity divergences we mentioned in sections ^^ 
and p.3| . They also arise when the eikonal approximation is used in a covariant gauge. In 
the integrated distribution, there is actually a cancellation between real and virtual terms, 
which means that in ( p. 27 ) 



f d^k^I{k+ = 0,k±) = 0. (3.^ 



This leads to the well-known "plus prescription", I j^ I . In TMD distributions, however, 

no cancellation occurs, since l{0,k±) ^ 0, and the light-cone gauge therefore introduces 
problems. The light-cone gauge is moreover not useful when several different collinear 
directions are relevant. 

The general method for factorizing the arbitrary order gluon couplings between the 
different regions is based on exploiting the gauge symmetries of the leading terms, and to use 
Ward identities (Slavnov- Taylor- Ward identities). The basic technique can be understood 
as follows. In Feynman gauge, let A: be a soft gluon coupling the regions S and A. We then 
have a contribution of the type 

A''{k,pA)9^.uS''{k). (3.29) 

Generally of course there will be many other couplings, and A and S will depend on 
additional momenta but that does not matter for the approximation we are explaining. 
The leading contribution is then 



A>'{k,pA)gt.uS''{k)^A+{kB,PA)S-{k) 



= A'^{kB,PA)^^f^^^S^{k), (3.30) 

k ■ UA 

where 

kB = {k-nA)nB = {0+,k~0^). (3.31) 
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Figure 11: Factorized structure in inclusive DIS in covariant gauge. The longitudinal gluon 
emissions are factorized into eikonal Wilson lines (double lines) to provide gauge invariant definitions 
of the parton distributions. Left: Quark distribution. Right: Gluon distribution where the gluons 
with black squares are transversely polarized gluons. 



Here ha is a light-like vector in the direction of pA, with ua ■ V = V^ for any V. Thus 
ks • nA = k ■ UA- Since now the polarization of the gluon k is multiplied by its momentum 
in the coupling to A, Ward identities can be applied. The eikonal denominator in ( 3.3C| ) 
gives a contribution in S from a Wilson line. The all-order gluon couplings between A and 
S can then be successively factorized into a Wilson line contribution in S. 

One can similarly make approximations for the H-to-A couplings. The eikonal terms 
that arise are then absorbed into A to provide gauge invariant definitions of the basic 
parton distributions (or fragmentation functions). An example in the case of inclusive DIS 
is shown in figure |Tl| where the Wilson lines are indicated by double lines. The procedure of 
using the Ward identities for extracting the gluon exchanges between the different regions 
proceeds very much the same whether one is formulating collinear factorization or TMD 
factorization. 

As we have seen, Wilson lines appear in the small-x formalisms as well, both in the 
Weizsacker- Williams distribution ( |2.5D and the dipole distribution ( p. 10 ). It is then rather 
important to understand the exact structure and derivation of these lines, in particular 
since differences appear between the dipole definition and the TMD distributions. We 
analyze these points in detail in [42]. 



3.1.5 TMD factorization 

In the hard scattering formalism, the need for TMD factorization becomes obvious when 
one considers observables which are more sensitive to the exact kinematics of the final 
state. A typical example concerns the almost back-to-back production of hadrons [4] in 
e"^e~ annihilation shown in figure ^. Other relevant processes where one needs to consider 
TMD factorization are single-inclusive hadron production at low p± in DIS (SIDIS) also 



shown in figure 12, and Drell-Yan lepton pair production shown in figure 13 where the total 
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Figure 12: Processes where TMD factorization is relevant. Left: Di-hadron production in e+e 
Right: Hadron production in SIDIS. 




Figure 13: Leading regions for TMD factorization in Drell-Yan lepton pair production. 

transverse momentum of the lepton pair is much smaller than the hard scale. In all these 
cases the kinematics is sensitive to low values of the observable transverse momentum q± , 
and one cannot therefore neglect any of the transverse momenta flowing through the regions 
Ca, Cb and 5", as doing so would significantly change the kinematics of the observable final 
state products. If on the other hand the relevant transverse momentum observables are 
large, of the order of the hard scale Q, then the effects of the transverse momentum flowing 
out from the collinear regions via the soft region is power suppressed and can be neglected. 
In that case one obtains the standard integrated (collinear) factorization. 

Note, however, that the transverse momentum flowing directly into the hard part H 
from the collinear regions Ca and Cb can still be neglected, since the error involved in this 
approximation is of order q±/Q which is small in the validity region of TMD factorization. 
As g^ — )• Q the TMD formula loses its accuracy but then one enters the region where 
ordinary integrated factorization is valid. When g_L ~ Q, the transverse momentum must 
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be a part of the hard region, physically it corresponds to the case where several high g_|_ 
partons emerge from H. Thus what determines the need for TMD parton distributions 
and fragmentation functions is the kinematics of the final state. The momenta entering H 
from the collinear region Ca or Cb can still be approximated to be on-shell, even in the 
case of TMD factorization. This is somewhat different than the small-x formulation where 
the gluon momentum entering the hard scattering (if there is any) is off-shell, its virtuality 
being determined by the transverse momentum. 

The factorization formula in case of hadron pair production in e^e^ annihilation in- 
volves the transverse momentum convolution of two fragmentation functions (since there is 
no hadronic initial state in this process). The factorized formula for the relevant hadronic 
tensor is obtained by applying the appropriate Ward identities for the longitudinally po- 
larized gluons exchanged between leading regions shown in figure [l^ (left graph). If the 
momentum entering regions Ca-, Cb and S is denoted respectively by kA-, kB and ks, then 
the factorized formula is given by (we denote Ca by A, and Cb by B for clarity) 

W^"" = I (fkA d^ks d^ks A{kA) B{kB) S{ks) Hf"'{q)6^%- kA- kB- ks). (3.32) 

The delta function can be used to fix ks^±, k^ and k^. One furthermore makes the 
approximation of ignoring k^ {^b) everywhere but in A (B), and ignoring kg everywhere 
but in S. These approximations are allowed since the corrections are power-suppressed at 
least as m?/Q'^. The integrals over these variables can then all be short circuited and one 
gets 

^"=1 d^kA,Ld^kB,± (j dk-^A{kA)\ (j dk^BikB)] (j dk+dk-sS{ks)\ H^^^iq) 

= I d''kA,Ld^kB,± AizA, kA,i) B{ZB, kB,i_) S{qx- kA,ir- kB,±)H'"'{q). (3.33) 

Each respective factor in the parentheses gives the basic operator definition of the frag- 
mentation functions and the soft factor. We mentioned in sections 3.1.2 and [3.1.3 that 



each given PSS contains subtractions of the smaller PSSs. Thus the collinear factors A and 
B in ( |3.33| ) contain subtractions of the soft region. Now, the unsubtracted collinear parts 
contain Wilson lines which arise from the factorized gluon couplings to the hard part H. 
This is done by using the approximation in (|3.7| ), rewriting this as in ( 3.3C| ) and applying 



the Ward identities. For the A-to-H couplings, the approximated momenta from i \i.1\ ) are 
kA = (^^5 0^, 0_l) = {k ■ ub) riA and therefore we get a Wilson line in the direction ub'- 

W{x; ub) = P exp ( -ig^ / dX A{x + n^A) ■ ub) ■ (3.34) 

For the B part we instead get a Wilson line in the direction ua- In figure |l^ we graphically 
represent the unsubtracted collinear part, including the Wilson line ( p.34| ) shown by double 
lines, for both a parton distribution (top two graphs) and a fragmentation function (bottom 



two graphs). The color representation of the Wilson line ( 3.34 ) is determined by the particle 
at the end of the double lines in figure |lj: Fundamental for a quark (top and bottom left), 
adjoint for a gluon (top and bottom right). 
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kA '^^ 




kA «i^/ 






Figure 14: Graphical representation of tire unsubtracted collinear part after the gluon couplings 
to the hard part have been factorized into Wilson lines in the direction ub- Left: Quark distri- 
bution. The black squares indicate transversely polarized gluons. Top: Collinear part in a parton 
distribution. Bottom: Collinear part in a fragmentation ftrnction. 
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Figure 15: The factorized soft part. On each side of the cut, the gluons that couple to regions A 
and B are factorized into Wilson lines in the directions ua and Ub respectively. 



The soft gluons are similarly summed into Wilson lines using ( 3.30| ). From the A side 
we see we get a line in the direction of ua while from the B side we instead get a line in 
the direction of ub- The definition of the collinear part involves always the hadron state 
|P), either as incoming (for a parton distribution) or as outgoing (for a fragmentation 
function) . The soft factor on the other hand does not contain such a hadron so it is defined 
as a vacuum expectation value which we represent in figure ^. 

As seen from ( |3.33| ), it is convenient to make a Fourier transform into transverse 
coordinate b± to obtain 

ly/^- = I d%^e-"i^-''^A{zA,b±)B{zB,b±)S{b^)H^'''iq) (3.35) 

which is simpler than the momentum convolution written above. 
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Figure 16: The soft factor absorbed into tlie unsubtracted parton distributions and fragmentation 
functions. In the final result, ua and ns can be taken exactly light-like since the rapidity divergences 
cancel those in the unsubtracted coUinear factor. The vector n^ cannot be taken light-like, however. 



In the final definition, the soft factor is absorbed completely into the collinear factors 
to define the final subtracted fragmentation functions given by [1] 



Dn^/fi^A, br, C, /x) = D^hTu^za, hr^ns) x 




S{h^;nA-,nc) 

S{br,nA, nB)S{b±; nc^,nB) 



X Z (3.36) 



Here ua and hb are taken light-like, and Z is the UV renormalization factor. The somewhat 



strange looking factor in the square root is shown in figure |16|. The precise motivation for it 
is described in detail in [1, Ch. 13]. The final definition is free from divergences associated 
with Wilson line self-energy corrections. The vector n(^ defining the directions of the Wilson 
line in the soft factors serves as the rapidity cut-off which we indicate by the C, dependence of 
the fragmentation function. The unsubtracted factor i5"ii'^"t) jg g^ygj^ exactly by the factors 
in figure [l^ (bottom left graph in current example), defined in addition with integral over 
k~ as in ( |3.33| ), and the Fourier transform from k_\_ to 6^. A similar definition applies for 
the second fragmentation function associated with the region B. The final factorization 
formula then reads 



VF'^^oc ^4^Fr(Q;Ai) 



where za b 



Q2 "/ 
PA,B/kA,B, and 



I d^^e-^'i^-'^DH^/fizA, br, C, Ai) DH^/j{zB,br C, f^), (3.37) 






TY%AH'j%BHf. 



(3.38) 



rMt 



H^ and iJ^ stand for the hard blobs shown in figure 12, defined to be irreducible in the 
collinear lines, and containing subtractions of the collinear and soft regions, just like in 



(|3^ 



The tensor W^'^ of course cannot depend on the rapidity cut-off C,, and this requirement 
is embedded in the Collins-Soper evolution equation of the fragmentation functions with 
respect to C- In SIDIS we instead have a convolution of one parton distribution (for the 
incoming target hadron) and one fragmentation function (for the final state hadron). 



iy^'^oc^i/f(g;/.) 



j d^xe-^^^-'^fj/HA^^ br, C, /") DH^/f{z, br, C, ^) (3.39) 
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where H'^'^ is given by the same expression as in ( 3.3^ ) (but of course the hard factors H'^' 



and H'^ are different in e+e~ and DIS), and x = k\/p\ and z = p~^/k]^. Thus the change 
is that one fragmentation function is simply exchanged for the parton distribution function 
of the target hadron. The parton distribution / is defined exactly as in ( p. 361 ) to include 
the soft factors, one simply needs to change /)>i°s'iti ^^ junsub -^j-^^qJ^ nieans (for quarks) 
replacing the bottom left graph in figure 14 with the top left one. 

Finally in the Drell-Yan process we instead have two parton distributions and there is 
no fragmentation function since the observed final state is leptonic. Thus 

W'^'^-^HfiQ;f,)JdH^e-^'^^-'^ff/HA^A,b±;C,fi)fj-/HjxB,brX,f^) (3.40) 

where now the hard coefficient H^'^ is the tensor for the on-shell partonic reaction ff^j*- 
The extra factor s in front of the integral arises from the definition of the hadronic tensor 
for the Drell-Yan process which reads 

T^^^ = s J d'xe"'--{pA,PB\J''(.x)r{0)\pA,PB). (3.41) 

In order to obtain a reliable estimate of H^'^ it is optimal to let ^ ~ Q so as to 
avoid large logarithms. The higher order corrections are then subleading in factors of 
tts(^ ~ Q) ^ 1 without any logarithmic enhancements, and thus fixed order perturbative 
calculations are reliable. Notice again that in all formulas above, the hard tensor H^'^ is 
always outside the transverse momentum (or coordinate) integral and the lines entering it 
are on-shell. 

Thus we see that the TMD parton distributions or fragmentation functions, compared 
to the basic parton model definitions, depend additionally on the variables C and /i. They 
consequently satisfy evolution equations with respect to both these variables. The evolution 
in fj, is given by the standard DGLAP equations while the evolution with respect to the 
rapidity variable C is given by the (Collins- Sop er) CS evolution equation [1]. The CS 
kernel controlling the rapidity evolution is the same for all the above reactions because it 
is determined by the soft factor which is the same in all the above examples. 

We have above outlined the fundamentals of factorization in QCD, in processes where 
a hard scale Q is present, and where the collinear directions scale with Q. In the small-x 
region there may or may not be present a hard scale. The traditional process to study is 
small angle two-particle elastic scattering where the momentum transfer t is much smaller 
than the cms energy s, and where the collinear momenta scale with ^/s. In this case the 
hard region, if present, has a scale Q which is fixed, and is therefore not proportional to 
the asymptotic variable ^/s. The leading regions are therefore somewhat different than in 
the hard scattering factorization. We will outline the relevant regions for the small-x case 



in section 4T where we examine single inclusive particle production. We now go through 
the main formulations of /c^-factorization in the BFKL and CGC formalisms, and compare 
these to the hard scattering case just discussed. 
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Figure 17: The multi-Regge-factorized form of the scattermg amphtude in BFKL. 
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Figure 18: Graphs contributing to Lipatov vertex. 

3.2 Factorization in BFKL 

"Factorization" in the BFKL formalism refers to the Regge factorization in which a given 
2 -^ n scattering amplitude is, in the asymptotic limit s — )• oo, written as a factorized 
product of effective vertices and couplings of "reggeized gluons". This is known as the 
"multi-Regge form" . The arguments for the factorized form of the 2 — )• n amplitudes go 
back to the pre-QCD days of Regge theory, and the so-called "multi-peripheral" models 
[62-64,73]. 

We illustrate the multi-Regge form in figure ^. Here the zig-zag lines denote the 
Reggeons, and each black circle denotes the Reggeon-Reggeon-gluon vertex. Figure ^ is 
in the Regge theory valid when s, — )■ oo for all i [63,64]. 

In BFKL, the vertical zig-zag lines in figure ll^ are given by gluons whose propagators 
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(in Feynman gauge) are obtained by 




where 



PA<1^) = -^ ^ P,M^^ ^0 = ^T^ ( ^ 1 (3-42) 



u.(,l,) - 1 = a.N^ I ['J).+.t,^(,/^;^^^)2 ^ (3-43) 



The function oo is called the "gluon Regge trajectory". The vertices in figure 17 are given by 
the Lipatov vertex which is an effective three-gluon vertex. The Lipatov vertex is derived 
from the tree- level graphs of the 2 — )• 3 partonic amplitude shown in figure ll^. The external 
partons may be quarks or gluons, the use of the eikonal approximations implies that the 
vertex is independent of the flavor of these particles (at least when the external particles 
are individual quarks or gluons). 

The fundamental assertion of the BFKL formalism is that the multi- Regge form shown 



in figure 17 is valid for all 2 — )• n amplitudes. It has been argued in reference [74] that the 
multi-Regge result can be shown to be correct to all orders, once it has been shown to be 
correct to one-loop order for all 2 ^ n amplitudes, by essentially using the same techniques 
(s-channel unitarity relations) developed in Regge theory in [63,64]. We are, however, not 
aware of any explicit higher order calculations in QCD of the 2 — )• n amplitudes for n > 3. 
For the 2 — )• 3 amplitude, the multi-Regge form has been derived in reference [75] up to 



one-loop corrections to the graphs in figure |18 



As we saw in the previous section, factorization has to be shown to hold for all orders. 



In section 4^ we shall show some examples of higher order corrections where TMD fac- 
torization is know to be violated. Since the multi-Regge formula leads to a /c^-factorized 
form (see further [42]) it is of relevance to consider such higher order graphs. As we will 
see, the breakdown of factorization might be hidden until higher order corrections. For 



example, figures 34, 36 and 37 show that factorization breakdown is not visible until 4 
and 5 gluon exchange in the 2 — )• 2 amplitudes. If we consider only one side of the cut 
2—7-2 amplitude, then factorization breaking graphs appear in 2 or 3 loop corrections to 
the 2 — )• 4 amplitude. Similar factorization breaking terms might be present in the 2 — )• 3 
gluon amplitudes at 2 loop corrections as well. It may therefore very well be that one-loop 
corrections do not exhibit any TMD factorization breaking. 

3.3 Factorization in the CGC 

The Color Glass Condensate (CGC) [10-15] is a semi-classical approach developed to deal 
with the QCD physics of "large" objects such as heavy ions. 

The set-up of the CGC formalism is rather different than the hard scattering factor- 
ization. The main assertion here is that the color degrees of freedom of a given hadron, 
such as a large nucleus, can be described by classical fields generated by a distribution of 
random color sources, pa {a being the color index), which arise due to the "fast" moving 
partons, i.e., those partons which are in the collinear region. These then act as sources for 
the softer gluons whose dynamics depend on the classical sources. 
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3.3.1 Basics of CGC 

The classical fields generated by these sources are determined by the solutions to the 
classical equations of motion 

D^F^^x) = J,^ (3.44) 



with Dy the usual covariant derivative. The generic solutions to ( 3.44| ) give classical fields 
A"^ that are highly non- linear in the sources pa- In the classical McLerran-Venugopalan 
(MV) model [34, 35] , the sources are assumed to originate from the valence quarks of the 
nucleons which are randomly distributed according to some weight functional, VF[p]. This 
is the distribution we encountered earlier in equations (2.3) and (B^ 



In the case of a single particle traveling in the plus direction, the classical current is 
taken as 

Jj^{x) = 5>'+gsPa{x~x^) (3.45) 

where the classical source p{x~ ,x\_) has a very narrow support in x~ . In the case of two 
particle scattering, with the incoming hadrons traveling along the opposite light-cones, one 
takes instead 

J^{x) = 5^^gsPi,a{x'',Xi_)+5^-gsP2,a{^^x^). (3.46) 

The model is defined at some scale A'^ which sets the applicability of the classical 
description. Here jl = + or jl = —. For a hadron with large momentum P^ along the 
direction /i, this means that all fields with /c^ > A^ are taken to be described by the 
classical sources p. The distribution W[p\ is therefore specified at the scale A'^. Physical 
quantities of interest in the model are calculated by functional averages using the classical 
distribution W[p\ as in ( |2.3| ) for a single hadron, and 

{O) = j Dpi Dp2 W^+ [pi] W^- [p2] 0[pi,p2l (3.47) 



in two hadron scattering. Of course, (3.47) is already in a factorized form 



3.3.2 Power counting and "dilute" and "dense" systems 

The treatment of two particle processes is then based on a power counting argument of 
the classical sources Qs p- A "dilute" particle in this power counting is defined to be one 
described by a source such that l^^ p| ^ 1 . For such a particle then, in the calculations 
only the first order dependence (^f^ pY is kept. Given a functional 0[pi, p2\ which depends 
on both pi and p2, expand it as a polynomial 

oo oo 

0[pi,P2\ =Y.Y. Onm{gsPlT{gsP2r. (3.48) 

n=l m=l 

The definition of particle 1 being dilute then means that 

oo 



0[pi, P2] ^ 0[pi, P2] 



1, dilute 



Y,<^im{gsPi){gsP2r. (3.49) 



m=l 
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Figure 19: Diagrammatic representation of particle production in "dilute-dilute" scattering in 
the language of CGC. 



Conversely a particle is defined to be "dense" if it is described by a source satisfying 
\gs p\ ~ 1- In that case, the dependence on gsp is retained to all orders. As for real 
particles, a proton or a deuteron is defined as being "dilute" , while heavy ions such as gold 
or lead nuclei are defined to be "dense" . Thus "dilute-dilute" scattering refers essentially 
to pp or pp scattering, while "dilute-dense" scattering refers to pA or deuteron-Nucleus 
{dA) collisions, and finally "dense-dense" scattering refers to AA collisions (lead-lead or 
gold-gold). Of course a proton in the CGC becomes "dense" at sufficiently high energies 
since the classical sources grow as a function of energy. 

In this setting, the quantum evolution is based on the logic of the leading logarithmic 
approximation (LLA) where the coupling Qs is fixed and small, ^f^ ^ 1. Therefore for a 
"dilute" object we have p < 1, while for a "dense" object we have p ~ ^/ Qs 3> 1. These 
assumptions lead to the formulation of factorization in the CGC approach [17,18,22-25]. 

We immediately notice that this power counting is rather different in logic than the 
power counting described in section 3.1.2| . Here the emphasis is put on the classical source 
p{x) specified in space-time coordinates. Any correction beyond the classical approxima- 
tion is calculated to order gl which amounts to a one-loop calculation. For processes 
involving protons then, calculations are kept at linear order in gsP for each proton which in 
a diagrammatic analogy means that at most two gluon couplings are considered. In figure 
19 we show an example of single inclusive gluon production in "dilute-dilute" scattering. 
Thus in the dilute limit factorization is essentially identical to that in the parton model we 
considered in section p.l.l| . 

In general, however, the extra gluon emissions between the different PSSs considered 



in section 3.1.2 all have small virtualities and they therefore couple strongly. In particular 



the soft gluons have all their momentum components small, and the QCD coupling of these 
gluons is therefore strong. That is, we do not have a situation where (^s ^ 1. Even in the 
case of a weak coupling at all relevant scales, however, such as in QED, is the formalism 
outlined in [j.1.2 and the factorization theorems rather useful for controlling the higher 
order corrections which still might be enhanced by kinematical factors. 

In the CGC higher order corrections are needed because the classical sources p can 
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have large values, \p\ ^> 1, but Qs itself is always small. Pure perturbative calculations 
are thus performed when \p\ < 1, which happens in the case of "dilute" particles. In 
the general treatment of factorization in QCD, or in generic field theories, however, large 
contributions arise from surfaces in the multi-dimensional space of momentum integrals 
where the integration contours are forced to go close to the singularities of the propagators, 
the pinch singular surfaces. In QCD the momentum lines in the PSSs have large couplings. 
This is the reason why factorization must be proven to all orders, and it is then convenient 
to employ the power counting analysis of the PSSs. Corrections are then guaranteed to be 
power suppressed in the large scale Q. In the case of small-x therefore, ideally we would 
want to formulate factorization ( "A;_L-factorization" ) up to power suppressed corrections in 

Of course the treatment of factorization cannot be purely perturbative for the reasons 
just explained. It is important to emphasize that the power counting methods of section 
3.1.2| rely generically on dimensional analysis and Lorentz invariance, and thus not ex- 



clusively on perturbation theory. The explicit calculations are of course performed using 
Feynman graphs, but the structures obtained have a meaning beyond strict perturbation 
theory. One can therefore apply the same methods to the small-x region where any hard 
scale might be absent. 

3.3.3 The LLA and basic logic of factorization 

As the LLA is important for the formulation of factorization in the CGC, we shortly outline 
the logic behind it. An all order result can be obtained by calculating the one-loop graphs 
using the eikonal approximation, and then exponentiating the result. If the one loop result 
for a certain process is Fi, and the tree level result is Fq, then usually one finds 

pY 

Ti=gl dyKs{y)-To, (3.50) 

Jo 

where dy = ^^ and the limits on y are determined by the kinematics of the given process. 
The kernel Kg is found by applying the approximations appropriate for a soft term. We 
can then write the complete result up to one loop as 

To + ri=(l + g^, I dyKs{y)]To. (3.51) 

For infinitesimal change in the scale we can write this as 

Tdy = {1 + g^dY KsidY)) Fq, (3.52) 

so that 

^''^~^'=9'sKsro. (3.53) 



dV 
This gives the all order LLA result 



Ff^^^ = exp r^^ y dyKs{y)]ro. (3.54) 
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Figure 20: Particle production by classical sources in the CGC. The crosses denote the classical 
field insertions. 



A similar construction is used in the CGC [17,18,22-25]. The idea is to start with 
a formula at the classical level, where the correlator of the classical fields is calculated 
using ( 3.47| ), and then to perform a one loop calculation as in ( 3. SO] ) and show that at this 
level the classical structure ( ^.47 ) still holds. The resulting one loop formula can then be 
resummed as in ( |3.52 ) and ( ^3.54 ) to obtain a final formula in the LLA. 

Take for example the single inclusive particle production in the scattering of two 
hadrons, described by sources pi and p2, which is studied in [17, 18,22-25]. The basic 
classical formula which is equivalent to a tree level calculation is given by 



E^, 



dN 



d^p/ 2(27r)= 



■E(I-^a(^)I' 



where 



-MA(p)=p'<(p)<r)(p)- 



(3.55) 



(3.56) 



We illustrate this in figure ^ where the crosses denote the insertions of the classical fields 
A'^''{p). Note that A'^'-{p) is a function of both pi and p2 so it contains the effects of both 
hadrons. At the pure classical level, one evaluates ( 3.55 ) using ( ^.471) . This gives 



{A,A^ 



M/O 



Dpi Dp2 Wa+ [pi] W^- [p2] {AW;1)[pi,P2] 



(3.57) 



where the subscript on the left hand side is to denote that this corresponds to the tree level 
calculation. The weight functionals can at this level be fully parametrized using the MV 
model from which an explicit result can be obtained for (|3.55 ). 

The one-loop correction to the tree level result is then found to be [24, 25] 



{AuA 



M/l^ 



DpiDp2W^+[pi]WA-[p2] 



■ A+ A" 

ln—Hi+ln—H2 



(A^jyl^')[pi,P2] (3.58) 



where each Hi corresponds to the "JIMWLK Hamiltonian" . Each Hi is a Hermitian 
differential kernel [39] (in the sense of functional differentiation) that acts on the classical 
fields A'^^A'^^ in (|3.58| ). We see that this result is analogous to ( |3.50| ). To understand the 
logarithmic factors in ( |3.58| ), note that if Ks{y) is independent of y (which it nearly always 
is), then the integral in ( p.SOj ) simply gives Y ■ K^- The rapidity Y exactly corresponds to 
the logarithmic factors in (|3.58) and we see that Ks corresponds'^ to Hi. Using that the 



"The JIMWLK Hamiltonian is of order 
sources gs p to all orders. 



in the quantum fluctuations, but it contains the classical 
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Hi are Hermitian one can then rewrite the complete one- loop result ( ^.58 ) as [24,25] 



cl ^cl 



{A,A^)i + {A,A^)o=f DpiDp2 (l + ln^Hi^ W^+[pi] ("l + ln— ^2") Wa-[p2]A'JA 
^j Dp, Dp2 Wp+ [pi]W^- [p2]{A'iA';^)[p,,p2] (3.59) 



In this rewriting one uses that formally the term containing the product H1H2 in ( 3.591) is 



of higher order (it is not of LLA) and thus neglected. One then gets exactly as in ( 3.54| ) 
the LLA result 

WdY = il + dY H)Wo -^ W^^^ = exp ( / dy H{yU Wq. (3.60) 

Equation ( |3.59| ) is referred to as the "high energy factorization", or "JIMWLK factoriza- 
tion", formula [17,18,22-25]. 

3.3.4 Comparison to TMD factorization 

In its derivation, ( |3.59D is rather different than the TMD factorization described in section 
|3.1.5| . For example, in ( tJ.59D there is a factorized product of the classical weight functionals 
VF[/7j] rather than a product of parton distributions and/or fragmentation functions. 



Equation ( |3.59| ) is in the literature implied to be a generalization of ordinary TMD 
factorization. In section 5 of reference [24] we can for example read that 
^^ JIMWLK factorization proven here is far more general and robust in comparison to the 
k±- factorization often discussed in the literature." 

The statement on the wider generality of the CGC formula is motivated by the obser- 
vation that one can for "dilute" systems obtain from ( |3.59| ) a formula which looks like a 
A;j_-factorized formula. Since this "dilute" limit involves a simplified approximation within 
the CGC formalism, it is therefore said that ( |3.59 ) is more general. For example, for the 



single inclusive gluon production using ( |3.55| ) and ( |3.59| ) one gets in the "dilute" limit a 
formula that looks like equation ( [4.8^ below which is the A;_L-factorization formula canoni- 
cally used in the small-x region. Moreover, within the CGC, the TMD gluon distribution 
can be calculated explicitly if W[p] is given. For example, the WW gluon distribution can 



be calculated from (2^) once W[p] is specified. The converse statement on the other hand 
is not true: It is not enough to have an explicit formula for ( |2.5| ) in order to extract W[p] 
uniquely. 



In this sense, it can indeed be said that ( 3.59| ) is more general than the TMD factor- 



ization. However, from a different perspective we find that this statement is misleading 
and not correct. Moreover, as we shall explain now, the factorization explained in section 



3.1 is actually more robust. 

Equation (3.59) is namely only derived at one loop order using the logic of the LLA 



while the TMD factorization is much more general and accurate than that. The LLA result 
for example gives no hint at all as to what the higher order corrections might look like. 
There are even instances where it gives the wrong result, even qualitatively, an example 
being the Drell-Yan cross section at zero transverse momentum where the LLA gives a 
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vanishing result while the true result that can be obtained from the factorization approach 
is non-zero [76]. Contrary to the LLA, in the factorization approach the higher order 
corrections are well controlled, and even if the explicit calculations of the higher order 
corrections can be difficult in practice, one can nevertheless make reliable estimates of 
their importance [1] . It is therefore not correct to say that the "JIMWLK factorization" is 
more robust than the TMD factorization. In fact the opposite is clearly true with regards 
to the accuracy of the derivation. 

Moreover, when in the CGC the dilute limit is taken, the TMD gluon distribution that 
appears in the factorization formula is given by [18,24,39] 



/(x,fcx;C)|dii„, = 7:^(/9(fc)/>(-A:))w ^=(i^+'(A;)F+*(-fc)) 



w. 



dilute ip. \f"VVKV 'V/W^p+ \- V"!^ \ '" J I VV ^p+ 

d^x d^y e'''^''^''~-y"^-'''^^''^'y^\F+\x)F+\y))w^p+- (3.61) 

The subscripts on the correlators imply that the averages using W[p\ are performed at 
the scale QP^ . Acting with the dilute limit of the JIMWLK Hamiltonian on the classical 
sources in ( 3.61| ) one then recovers the BFKL equation for the object f{x,k±;C) (for a 



simple demonstration of this, see [39]). Thus the BFKL formalism can be identified with 
the dilute limit of the JIMWLK formalism. Since for example the CCH formalism [65,66] 



is based upon BFKL it is indeed correct that ( |3.59| ) presents a generalization of the work in 
[65,66]. Moreover, as the work in [65,66] is frequently referred to as the "A;_|_-factorization" 
formula, in this sense {i.e. if 'A;_|_-factorization" is understood to refer to [65,66]) (|3.59| ) is 
more general than "A:^-factorization" . The CCH formalism is, however, also based on the 
LLA, and neglected terms are therefore not power-suppressed. 

The argument for factorization in [65,66] is based on the use of the light-cone gauge (in 
DIS) or axial gauge (in hadron-hadron collisions). The final expression in (3.61) actually 



equals the earlier light-cone gauge expression in (p^). A similar definition also appears in 
the factorization approach as we discussed in reference to figure ^. It is, however, important 
to realize that ( |3.61 ) is supposed to hold in the dilute limit for any gauge, even a covariant 



gauge. This is in fact in line with the power counting we discussed in section [3.3.2 above. 



where the definition of the dilute limit is that Qs p -^ 1 • This is of course why equation 



( 3.61 ) is second order only in p (the first order term (/?) vanishes when, as usual, the 



distribution W[p\ is taken to be a Gaussian). 

It is then important, however, to realize that the distribution thus obtained in (3.61) 
is not the TMD gluon distribution in the TMD factorization approach. One cannot in the 
TMD factorization in covariant gauge simply drop the Wilson lines because as mentioned 
above, the soft gluons exchanged between different regions have strong coupling. The TMD 
factorization therefore does not correspond to the dilute limit of the CGC. The factorization 



(|3.59D does indeed represent a different structure than the TMD factorization, but it cannot 
be said to be more general since it contains only a one-loop calculation while the TMD 
factorization is valid to leading power, rather than to leading logarithm. 

We want to emphasize that this point is important and not merely a technical detail. 
The reason is that if we wish to establish factorization for a given process, then a possible 
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Figure 21: Space-time illustration of the scattering of two hadrons Hi and H2- In the classical 
solutions of CGC, one has independent solutions in the non-casually-connected regions i?i and i?2- 
The solutions in the forward light-cone region i?-(- are however non trivial and give rise to the 
so-called Glasma [79]. 



breakdown of factorization may not show up until higher order corrections are considered, 
beyond the dilute limit. In section E^ below we shall discuss this point in the context of 
the small- a; single inclusive gluon production formula. As we explain there for example, 
the factorization breaking graphs studied in [77, 78] do not show up until one considers 
two gluon corrections to the parton model graphs, see figures p3 and 36. In terms of 



Feynman diagrams, the parton model graphs themselves are already at two loop order, 
so the factorization breaking does not appear until 4 loop graphs. In the dilute limit 
considered above, or in the logic of the LLA, however, this would have been completely 
missed. 

It is therefore difficult to discuss the validity of factorization at one loop order, or in 
a "dilute" approximation in the sense described in section 3.3.2| . In that case for example 
proton-proton collisions become rather trivial but the real situation is far more complicated 



than that, as should be obvious from our discussion in section 3.1 



3.3.5 Causality and factorization 



An argument given for the validity of (3.47) is based partly on causality (see for example 
[24]), namely that two fast moving hadrons as shown in figure ^ cannot interact with 
each other prior to the collision. This by itself, however, does not imply that there must 
be a factorized structure for the observable under study. In covariant gauge, it is true 
that the hadrons cannot interact prior to the collision, and they are therefore causally 
disconnected before the collision. To write a factorization formula, however, one must be 
able to factorize the soft emissions which can occur at late times after the collision. Even 
though the hadrons are casually disconnected prior to the scattering, the scattering might 
produce color entangled states which break factorization (see section ^^ below) . 

Moreover, the causality argument does not hold in "physical gauges", such as the 
Coulomb gauge or the axial gauge, where manifest Lorentz invariance is broken and faster- 
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Figure 22: Examples of processes considered in the hybrid formalisms. Left: Photon production 
in quark-Nucleus scattering. Right; Quark production in quark-Nucleus scattering. 

than-light propagation is possible in individual graphs. The causality violating contribu- 
tions should cancel in the final, physical results, but the proofs can be very non-trivial. 
It was in fact early reported that [80, 81] the faster-than-light interactions in the phys- 
ical gauges would correlate the hadrons prior to the collision and break factorization in 
hadron-hadron collisions such as in the Drell-Yan process. 

Factorization, both collinear and TMD, in fact holds in Drell-Yan [1]. The problematic 
gluons are precisely the Glauber (Coulomb) gluons which complicate the proofs. However, 
in covariant gauge one can consistently deform the integration contours away from the 
Glauber region and restore factorization. Whether this can be done for more complicated 
interactions is of course the real question. We discuss this more in section |^ below. What is 
clear, however, is that the proof of factorization is much more intricate than what general 
causality arguments would suggest. 

3.4 Hybrid formalisms 

Some of the applications of the CGC model falls into a category that we shall call the 
"hybrid formalisms" , since they combine the CGC treatment above with that of collinear 
hard scattering factorization (see e.g. [55-57,82-84]). These formalisms are used especially 
in proton-nucleus {pA) collisions. Typical examples include photon production, Drell-Yan, 
and soft particle production in the forward region (all in pA collisions). As we shall show 
here, however, these formalisms do not address the question whether there is factorization 
for the given process, and the validity of the proposal to mix collinear factorization with 
the CGC treatment is not at all clear to us. 



We illustrate in figure 22 two examples of the processes considered in this framework. 



The upper incoming line refers to a quark of momentum p while the lower thick line with 
momentum P refers to the nucleus. The proton is therefore not treated explicitly. Only 
interactions between the active quark and the nucleus are considered as indicated in the 
figure. The gluon attachments between the lower and the upper blobs is described by a 
Wilson line exactly as in (^.9|). 

Consider the quark production case. The incoming quark is here on-shell and has zero 
transverse momentum [56,83]. Thus the transverse momentum of the "observed" final state 
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quark is determined by the momentum transferred from the nucleus. This dependence is 
then given directly by the Fourier transform of the Wilson line (| 



W{k^) = / d'x^e-"'^-''^W{x±_). (3.62) 

There are two possibilities, that the observed particle has low transverse momentum, 
of order of the typical intrinsic transverse momentum, i.e. k± ^ m, or that it has large 



transverse momentum, of the order of a hard scale Q. The cases in figure ^ suggest that 
the particle is produced at low transverse momentum, since the k± dependence is directly 
determined by ( p. 62 ). In that case, however, there is no reason to neglect the transverse 



momentum from the proton side, as this could completely change the kinematics of the 
observed final state particle. One must therefore formulate a TMD factorization formula, 
with the TMD parton distribution and fragmentation function of the proton taken into 
account. If on the other hand the produced particle has large transverse momentum, then 
a hard region must properly be included in the process. This, however, is not the case in 
figure ^. 

The central idea of the hybrid formalisms is based on what is called the "factorization 
of mass singularities". Here an emphasis is put on the mass divergences that appear 
in massless on-shell partonic reactions [85]. This procedure is in fact widely found in 
the literature when dealing with collinear factorization. Despite its wide use, however, 
it is a physically misleading procedure. It is in fact a rather different approach than 



the factorization explained in section 3.1 above. In this approach it is first asserted that a 
hadronic cross section a^, or a structure function Wh, is a convolution of the corresponding 
partonic cross section ap, or structure function Wp, and a so-called "bare parton density", 

^barc. 



WUq,P) = Wpiq,(P) ^ r'%0- (3.63) 



The convolution in the variable ^ is here the same as in equations ( 3.1^ ) and (3.19). In 



the appendix of [83] (see also [86]) it is for example asserted for single inclusive hadron 
production that the differential cross section is given by 

dah{p,Ph; P) = /^"-"^(O » D^'^'^iz) » dap{z, C; P) (3.64) 

where p, ph and P are the momenta of the incoming proton, the produced hadron and the 
incoming nucleus respectively. For the forward particle production shown in figure |2^ (right 
graph), the incoming quark has momentum ^p while the outgoing quark has momentum 
Ph/z and it subsequently fragments to produce the observed hadron p^. 

In both cases, the calculations are then performed with massless partons and with the 
parton entering the scattering taken to be on-shell with zero transverse momentum. With 
these assumptions, collinear divergences appear in the partonic cross sections. It has been 
shown in the case of ( |3.63| ) that the result for Wp can be written as a convolution of a 
divergent factor, D (not to be confused with the fragmentation function), and a finite cross 
section a [87,88]. Using the associativity of the convolution operation (gi, one can then 
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write 

Wh = {a^D)^ /^^--^ = a^{D^ f^^'^) = a0 f^"" (3.65) 

where the "renormahzed" parton distribution is given by f^^'^ = D(^ f^''^^'^. This final result 
is actually just like that in ( |3.23 ). The just outlined procedure is, however, problematic 



for several reasons. 

To begin with, there is no proof for the assertion ( [3. 631) or ( 3.64 ), which actually is the 



statement of factorization. In the hybrid formalisms, it is simply stated that the proton 
side can be treated by integrated distributions. It is also in this case not exactly clear what 
the "bare parton density" is. According to the set up of the formalism, it is supposed to 
represent a distribution of on-shell and massless partons in the proton. This, however, is 
physically an ill-defined concept since quarks and gluons never exist as on-shell particles 
inside real hadrons. Moreover, if quark masses are retained in the calculations, there are no 
collinear divergences. It is therefore dangerous to emphasize the importance of the mass 
divergences since they appear only due to the approximation of using massless on-shell 
partons, and are therefore of a spurious nature. The "regularization" procedure just above 
is therefore conceptually different than ( ^.23 ), and crucially, it is not in any way related to 



factorization even if this might seem to be implied. 



In the analysis of section p.l| what factorization means is that a given cross section or 
structure function can be written in a factorized form where each factor is associated with 
a given momentum region. For example, in the case of DIS it means that we can factorize 
the hadronic tensor as 



W^ -CY>{Q/^x,z/x,e)iJ>{z■^x,e) (3.66) 

up to power-suppressed corrections. We can also write this simply as 



W 



^(0) ^ ^(0) 



Cf>®f]^>. (3.67) 



The meaning of the bare parton distribution is then that it is the gauge invariant integrated 
or TMD parton distribution constructed out of the bare fields of the Lagrangian. An 
example is the light-cone gauge definition of the bare integrated gluon distribution in 
( |3.22 ). In fact any gauge invariant definition of a parton distribution involving suitable 



Wilson lines, as for example in the WW distribution ( ^.^ ) or the dipole distribution ( 2.10| ), 



must refer to the bare distribution, because the gauge transformation properties are obeyed 
by the gauge links constructed out of the bare fields. So strictly speaking we should have 
denoted all those distributions as in ( 3.22| ) and ( p. 66 ), i.e by a superscript f^^'. It is 



important, however, to realize that this bare distribution, constructed out of the bare 
fields, cannot be the same as the undefined quantities in (3.63|) and (3.64). For it is clear 



that it does not represent any distribution of on-shell, massless partons as is implied by 
( 3.63]) and (|3.64 ) . Once factorization has been proved as in ( |3.66|) (or in (|3.37 ) , ( 3.3S| ) and 



( 3.40[) ), which itself is a very non-trivial statement, then renormalization is a matter of 



removing UV divergences by a suitable redefinition of the parameters of the Lagrangian. 
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Order by order in perturbation theory this means adding the necessary counter terms from 
the Lagrangian, for example in the MS scheme. One then finds the renormahzed parton 



distribution via a formula as in (3.23). For (3.66) we find that 
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Cj /, 



(3.68) 



where fj is the renormahzed distribution given by ( p. 23 ), and the Kronecker delta in the 
first line also includes delta functions with respect to the momentum convolutions. This 
procedure still applies if the quark masses are retained in which case there are no collinear 
divergences at all. 

Now, in the factorization approach, one can indeed approximate the momentum en- 
tering the hard scattering factor as massless and on-shell. It is crucial, however, that the 
hard scattering factor, C in ( |3.68| ) , is defined with suitable subtractions (as we indicated in 
( 3.21 ) and showed in figures ^ and |^) so that it genuinely describes a wide angle scattering 
with scale Q (we also note that the UV divergences of the subtraction terms are regulated 
by Z^^ in ( 3.68| )). In the TMD factorization in section 3.1.5 for example, the errors in ne- 
glecting the transverse momenta, q±, in the hard factor goes as q±/Q which indeed is small 
in the validity region of the formalism. In ( |3.63 ) and ( 3.64| ), however, this is no longer the 



case (in particular in (|3.64[ ) the partonic part still contains the scattering off the nucleus). 
Moreover for particle production at low transverse momentum, the neglected transverse 
momentum, from the proton side, is of the same order as the transverse momentum of the 
final state particle which means that the error is substantial. 

What is also non-trivial is that TMD factorization is mixed into the formalism of the 
factorization of mass singularities. If in fact we want to treat the given problem using 
TMD distributions, then in the small-x case where the produced particle is typically soft, 
one must consider off-shell matrix elements, precisely because of the reason just explained 
above. The off-shell matrix elements must then carefully be specified, to ensure gauge 
invariance (or rather gauge- independence), and one cannot use on-shell incoming partons. 
For the lowest order contributions, gauge independent off-shell scattering coefficients have 
been calculated in the CCH approach [65,66], and an explicit all order definition in the 
case of BFKL is given in [89]. See also [90-93] for more recent considerations. 

To summarize this section, the hybrid formalisms do not really address the question 
of factorization. Factorization is in a sense assumed from the start, via equation ( 3.63| ) 
or (3.64). In fact the real problem is to show a factorization like in ( 3.66| ) to start with. 
Moreover, the procedure which is referred to as the renormalization of the parton densities 
is conceptually very different from what is the case in the hard scattering factorization. It is 
moreover physically a misleading procedure since the basic structures are not well-defined. 
Additionally we have seen that for particles produced at low transverse momentum, TMD 
distributions must be used also from the proton side, but then of course one must first 
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Figure 23: Production of soft hadrons in the small- a; limit. The observed hadron p is associated 
with the soft region. 

formulate a valid TMD factorization formula first, which might not be possible. We will 
in the coming sections analyze single particle production in the small-x region. 

4. The fundamentals of single inclusive particle production 

We will now give a comprehensive analysis of single inclusive particle production in high 
energy QCD, explaining many details which are usually overlooked. We will start by going 
through the basics of particle production, giving an overview of the leading regions in 
different kinematical situations. We then go on to analyze single inclusive gluon production 
in hadron-hadron scattering which is a process that has been widely studied (see e.g. 
[16-20,22-31,94-96] and references therein) in the small-x region. We will first go through 
the process using the axial gauge which is essentially the gauge on which the arguments for 
factorization are based, for example in [33,94-96]. We will in detail explain the technical 
difficulties of the axial gauge, and why after all it is not convenient for proving factorization. 
We will then discuss hadron production from a more complete point of view, by building 
upon the analysis of the leading regions for the different kinematical cases. Finally we 
shall address the exact form of the TMD gluon distribution associated with this process, 
finishing with a discussion of the validity of factorization. 



4.1 The different cases of particle production 

In figures 2^, 24 and ^ we list the possible scenarios for single inclusive particle production 
at small-x. In this section we explain the physics of the different cases. 

Figure p3 represents a typical scenario of particle production in the Regge region, 
namely that of a soft particle produced at a typical small angle scattering event. In this 
case there is no hard region. All virtualities are of the typical soft scale vn?'. The momentum 
p of the produced particle therefore typically scales as \p^^\ ~ m. This case is relevant for 
soft particle production at mid-rapidity. The inclusive charged particle spectrum at mid- 
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Figure 24: Production of liadrons in the fragmentation region of particle B in the small- a: limit. 
The observed hadron p has rapidity close to that of B. A similar graph exists for production in the 
opposite direction close to A. These cases require the use of fracture functions rather than ordinary 
parton distributions and fragmentation functions. 



rapidity, 



dN,h 



dr] 



(4.1) 



r7=0 



has been measured by the different experimental groups at the LHC; ATLAS [97], CMS 
and ALICE [99,100]. This also happens to be the mostly studied case in the applications 
of small-x physics [16,20,26-31,101]. 



Next, in figure 24 we show particle production in the case where the produced particle 
is close in rapidity to one of the hadron beams. This case therefore covers the forward 
production of particles. At the LHC, the CMS detector can detect particles in the pseu- 
dorapidity range |r/| < 5 thanks to the hadronic forward calorimeters. Since the particles 
traveling in the forward region have enormous longitudinal momentum, they must of course 
have high p^ as well, since otherwise they would have too large rapidity and escape de- 
tection via the beam pipes. In CMS for example [102] forward jets (not hadrons) in the 
rapidity range 3.2 < \'q\ < 4.7 have p± > 35 GeV. One can also arrange for events where a 
hard di-jet is produced at central rapidity, to accompany the forward jet. The correlations 
between the forward jet and the central jets then offer important insight into the parton 
kinematics, see e.g. [92, 103, 104]. Actually if the momentum of the produced hadron be- 
longs to either Ca or Cb, then one has to use so-called fracture functions rather than 
ordinary fragmentation functions or parton distributions. 

Finally in figure ^ we show the case where the hadron is produced with large rapidity 
separation to both beams (for example in the central region) and where a hard region is 
present. This could for example be the case where the components p^ are typically of 
order Q ^ m ox where we are looking at an event where a hard collision is present, that is 
jets with large p_\_ are produced in addition to the particle we tag (we do not show these 



additional jets in figure 25). The region decomposition here needs some explanation. 
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Figure 25: Production of liadron in tlie presence of a hard factor. Tire soft region coupling the 
coUinear regions is also present, and additional collinear factors emerging from the hard scattering 
may be present as well, but for simplicity we do not show these here. 



In section 3.1.2 we classified momenta according to different possible scalings. The 



external scales in that case were set by Q which also happens to be the hard momentum 
scale in the process. Therefore such a classification is appropriate when the components 
of the hard momenta scale with the longitudinal momenta of the external particles. The 
decomposition is thus appropriate when x is not too small. In that case we noticed that 
the only real possibilities for a pinch of a given momentum k^ were as follows: 

• None of k^ scales with Q. Then we can characterize k^^ by the typical soft scale m, 
i.e. k^ ~ m. Then k £ S. 

• A longitudinal component, say k~^ or k~ scales with Q. Then we have k~^ ~ Q, 
k~ ~ m'^/Q, A;_|_ ~ m and vice versa. In this case k £ Ca (or k £ Cb in opposite 
case). 



A: 



Q in which case also k^k ~ Q^. Thus A;'* ~ Q and in that case k G H. 



Using this classification we then saw that a power counting analysis gives that at 
leading power, Ca and Cb can be connected to S via arbitrarily many soft longitudinally 
polarized gluons, while again arbitrarily many collinear gluons can be exchanged between 
H and the respective collinear region. 

In the small-x case we have a different situation. In this case the large components of 
the external particles scale with ^/s but the momentum transfer remains fixed as ^/s — >• oo. 
Thus in this case there is no region in which all momentum components scale with the 
asymptotic parameter y/s. In the soft production case one has the possibilities that 

• None of k^ scales with ^/s. Then generally k £ S. 

• k^ or k~ scales with ^/s. In this case k G Ca or k £ Cb respectively. 
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p + ki-k 



p + ki 



fci — k k 



Figure 26: Coupling of two gluons from Ca to H. 



There may, however, also be present hard collisions which give rise to jets or hadrons of 
several tens of GeV. Thus we may very well have regions where k^ ^ Q. We then propose 
the following classification 

• If A;+ or k~ scales with y/s, then just as above we let k S Ca or k £ Cb respectively. 

• Let \k'^\/y/s — )• as y/s — )• oo, but such that for example |/c'^|/|A;~| ^ 1 and 
Ifc+I/IA;*! ^> 1. Then even though A;+ <C -v/i, we shall let k G Ca- In the opposite case 
we of course let k G Cb- To characterize such cases we shall let k^ ^ Q <^ ^/s (or 
k~ ^ Q <^ y/s) where Q ^ m. 



We define the region where k^k~ ~ Q^ to be the hard region. Thus in figure 25 there 
is momentum k^ ^ Q flowing into H from Cb, and momentum k^ ^ Q flowing in 
from Ca- The momenta going out from H to the final state is then characterized by 
the scale Q. 



Momenta such that |fc^| ~ m <C Q are as before classified as soft. In figure 25 we do 
not explicitly draw the soft subgraph to keep the notation simple. 



With this classification we can then understand figure 25 . Notice that the momentum 
lines whose large components scale with y/s, and therefore belong to one of the collinear 
regions, cannot join the collinear region to the hard region H, since in that case a large 
momentum y^ would be transferred to H, and we would no longer be in the small-x region. 



Thus in figure 25 the lines joining Ca,b to H belong to the second class above. This is a 



different situation then in section 3.1.2 where any line in Ca,b can join that region to H . 



We shall now argue that the power counting is essentially the same as in section 3.1.2, 



despite the somewhat different kinematics. In figure |26| we show an example where two 
gluons from Ca couple to H as defined above. These two gluons have k'^ ~ Q, and in the 
lower end (not shown in the figure) they couple to collinear-to-^ gluons which may have 
momenta scaling as ^/s in the plus direction. The leading contribution is then given by 



(fh 






'(p + A:i)2 (p + A;i - kf 
where p £ H. We then write this expression as 



p^ ki k"^ 



d'^h 



d^k- 



2p-kJ 2p-{k{ 



^7-^^^A-^+(^i,^,P^). 



(4.2) 



(4.3) 
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Now, as in section |3.1.2| we characterize the momentum couphng Ca to -ff by a scale A^^, 
such that k~ ~ ^a/Q ^^^ ^-i- ~ ^A- When the momentum k in figure ^ couples to 
A++(/ci, k,pA), there will be a typical contribution of 



V~s 



{PA + kY p+k- V^xl/Q 



Q_ 



(4.4) 



The factor y/s in the numerator comes from the large boost of A in the + direction. 
Remember that in the case covered in section ^.1.2 we have 

Q Q Q 



(pa + ky Q\\/Q 



^A 



(4.5) 



As we see (4.4) agrees with (4.5). We therefore essentially have the same situation as before, 
that is arbitrarily many longitudinally polarized gluons of the second type in the classifi- 



cation above can connect the collinear regions to H in figure 25. Indeed the contribution 



from figure 26 gives 



(fki 



^ p^kj 



{p + kiY 
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(4.6) 



The term in the brackets corresponds to the contribution from gluon ki only. The factor 
outside therefore gives the contribution from attaching the additional gluon k and we see 
that it gives a logarithmic contribution 



dXA 
Xa 



(4.7) 



so that there is no power suppression for coupling the extra gluon k to H. 

To ensure the validity of all these arguments it is again important that one can perform 
contour deformations out of the Glauber region. We will in the next sections give a careful 
analysis of the factorization arguments that are based on the use of axial gauge, and we 
will show the difficulties associated with such arguments. We will continue the general 



discussion of single particle production in section 4.5 below. Before that, however, we want 



in the coming sections to concentrate on the small-x single inclusive gluon production cross 
section that has been widely used for phenomenological applications. 

4.2 The small-x formula for gluon production 



The most basic process for gluon production is depicted in figure 27 where the idea is that 
two gluons, kA and ks-, each belonging to one of the incoming hadrons, fuse to produce a 
gluon of momentum / which then emerges in the final state. The argument for the validity 



of figure 27 is based on the use of axial gauge. The situation is similar to that in figure 



where the use of the light-cone gauge eliminates all higher order gluon exchanges to leading 
power. 

The factorization formula being used is given by [16,20,26,33,96] 

da 2as 



d'^l±dy 



Cpl^ 



j (fk± fAiy, kA,±) fB{Y-y, kB,i_) 



(4.8) 
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Figure 27: Single inclusive gluon production in hadron-hadron scattering according to equation 



(4.8). 



where 

kA = k, kB = 1 - k, (4.9) 

y is the rapidity of the produced gluon with respect to the right moving hadron pA- The 
functions /a and fs represent the respective TMD gluon distributions, and we shortly write 
down the definitions used. The origin of equation (4.8) goes back to the GLR papers [94-96] 



where the function / is "defined" as the derivative of the integrated gluon distribution 
(which is called the "gluon structure function" in [94-96]) 

/(?/,fc±) = ^^^^, y = lnl/x. (4.10) 

We note that this relation (or rather the inverted integral version of it) is a direct applica- 
tion of the parton model result ( |3.5| ) , although in the parton model the integral over the 
unintegrated distribution is over all k±. There are several good reasons for why one should 
be very cautious with the naive application of the parton model result. We will discuss 
this more in [42], and see also the comments just after equation ( 4.94| ) below. 



As for the validity the factorization formula ( |4.8| ), it is in the literature common to 
cite the works [33,41]. Reference [41] makes use of the dipole formalism in studying the 
deep inelastic scattering on a large nucleus, where the nucleus is taken to be described by 
the classical MV model. In this case the "unintegrated gluon distribution" is taken to be 

f{k±;y) = j^^J d\^ j d%^e-"'^-'^V^,JVGir±,br,y), (4.11) 

where Mg has the same meaning as A/" in ( p.6| ) but is instead written in the adjoint repre- 
sentation as 



m-1 



^fGir±,br,y) = l-T?T—^{^''{W{b±+rj2)W^b^-rj2)}) . (4.12) 

y 
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Figure 28: Poles in the plane of k and possible integration contour. 



The Wilson line W has the same form as in ( |2.9D , but with the replacement tp — )■ T"" 
where T"" is the adjoint color matrix. As can be seen we have indicated the dependence 
on the rapidity variable y a bit differently in (|4.11 ) than in ( [4.81) . We have in fact done 
this in purpose and it should later on be clear why we have done so. Notice for now that 
(4.11) is essentially the dipole distribution ( p.lOD , with the difference that it is here written 
using Wilson lines in the adjoint representation. It is important to note, however, that 
(4.11) is not directly derived from the formalism in [41]. Its form is rather asserted by the 
assumption that the dipole formalism used in [41] is equivalent to the factorization formula 

(El)- 

The results of [41] are in turn partly based on [33] where the light-cone gauge is 
employed and it is argued that the leading regions have the structure shown in the figure 
27. We also note that a similar factorized formula is found in the classical DDT paper [105] 
from the early days of QCD. 

We will therefore now go through the light-cone gauge calculation. First, however, we 
need to specify the kinematics more carefully. 



4.2.1 The kinematics 

We denote as usual the incoming momenta by pA and pB ■ In the cms frame in the limit of 
very high energy and neglecting the masses one has 



PA = (vV2,0,0x) 
PB = iO,^/7J2,0^) 



(4.13) 
(4.14) 



so that s = 2pA ■ Pb = "^PaPb- 



We now ask which of the cases in section |4.1| above that is relevant here for figure 27 



From figure 27 we see that there will be a typical contribution of the type 

Numerator 



{k\ + ie){k% + ie){{pA - Ua? + ie){{pB - ksf + ie) 



X (Rest of graph). (4.15) 
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Let us now consider the kA part, and the integral over k^. We note that if A;j^ < 
or kj^ > p~^ then the poles in the k^ plane are either both in the upper or in the lower 
half plane respectively. In those cases we can deform away from the poles simultaneously 
and we get a power suppressed contribution. Thus we have < k~^ < p^. In this case the 
pole from the kA propagator is in the lower half plane, while the pole from the lower blob 
is in the upper half plane and the integration contour is therefore trapped. We show the 
pole structure in figure p8l We here simply denote the order of the magnitude of the poles, 
setting A;_L ~ m. If we denote the two poles in k^ by k^ and fe^ we see that the distance 
between them satisfies 

\k--k-\-'^(^+ ' 



2 V^A Pa ^a 
k^ 



2kl 



(4.16) 



Thus when A;^ — t- (and all masses in the theory are neglected) we get an exact pinch. 
As k'^ — )• y/s we also see that the poles are increasingly pinched and there is potentially a 
large contribution (from the collinear PSS). This, however, corresponds to the non-Regge 
region and is therefore not relevant for us. Now, we can let the integration contour pass 
near the pA — kA pole in which case |/c^| ~ rv? j ^ (if actually the lower blob consists of 
a single spectator line then this pole becomes exact because there will be a delta function 
setting the spectator line on-shell). 



We might, however, also ask what happens if there is a hard region as in figure 25 
Assume for example that /_l ~ Q. As described in section [4.1| , we must then have k'^ ^ Q 
and {I — k)~ r^ Q (we now use that kA = k and ks = I — k). Then 



k+ r^Q, k- r^ QVV^, (4.17) 

and thus 

k+k- ~ -^g2 ^ q2 _ fc^. (4.18) 



's 



The last estimate comes from k± ~ |/^ — k±\ ~ /_l ~ Q. This, however implies k'^ ~ Q^, 
which means that k is actually not in the collinear-to-^ PSS. It instead belongs to H and 
one can see that this case is suppressed. To get a leading contribution we want fc^ ~ m'^, 
and similarly (l — k)"^ ~ m^, but then it is easy to see that we cannot have l\_ ~ Q^. 



Thus for the graph shown in figure 27 we do not have the situation in figure E3. To have 



a situation with a hard region like in figure 25 we must instead consider an additional 
collinear, unobserved, jet that emerges from H. This, however, makes the situation rather 
complicated and changes the physics involved quite a bit. We shall briefly come back to 



this case in the discussions in sections t4.5 and 4.6 below 



For analyzing the small-x formula ( [4.^ ) we consider the situation where k'^ ~ m. That 



is we essentially have the soft (or perhaps semi-hard) case in figure 23. A similar analysis 
as in above for the / — k line implies that in this case 

|A:-|~mV\/s, |/+-/c+| ~mV\/s. (4.19) 
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Therefore 



so that 



|/c| ~ (m,mV\/s,m,)) (4-20) 



k+ = 1+ + 0{m^/^/^), l-^\k-\. (4.21) 



Thus 



and 



k+^k\ {r-k-)^l'-k\ (4.22) 

|A;+A:~| ~ mV\/^<m2~ A;i (4.23) 

|(/+-A;+)(r-A:-)| ~ m^/^/^ < m^ ~ (/^-A;^)^. (4.24) 

Both gluons k and l — k are therefore in the Glauber region where the transverse momentum 
components dominate. In hght of what we have said earher it would seem that we better 
avoid the Glauber region. Note, however, that there is no Glauber pinch here so we can 
deform out of the Glauber region if necessary. 

4.3 The use of the hght-cone gauge 

The main argument for the validity of (|4.8| ) given in [33] is based on the use of the light- 
cone gauge. Since an axial gauge is also used in [96] to argue for the validity of (|4.8| ), we 
now go in through the derivation in these gauges. We shall start with the light-cone gauge 
in this section and then in the next section give an account based on the non-light-like axial 
gauge. We notice that axial or light-cone gauge is also used in establishing the factorization 
formulas in the CCH [66] and CCFM [70] formalisms. 

There is in fact problem with the kinematical arguments given above in the light-cone 
gauge. If we choose the gauge A~^ = then the treatment of the A part is as we just 
described. However, we do get a problem of the treatment of the B side. Similarly we do 
get a problem of the treatment of the A side if we work in A~ = gauge. In fact the latter 
is the gauge on which the arguments in [33] are based. What we want to demonstrate in 
this section is that the light-cone gauge is clearly improper for the treatment of hadron- 
hadron collisions (be it proton-proton, proton-nucleus or nucleus-nucleus collisions). We 
will offer several reasons for this, and we return to the just mentioned issue at the end of 
this section. We will now simply push forward with the light-cone gauge and then see that 
it leads to severe problems. 

Let us now denote the gluon propagators by 

Then in the light-cone gauge n ■ A = we have 

n^^k'^ n^kf^ 
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We shall write Nf^^ik) as 

Nf^'ik) = G^'^ik) - K^"'{k), (4.27) 

where 

^ 7^^^k'^ 

G^'^k) = g^"" - --— (4.28) 

k ■ n 

Kf'^k) = ^— . (4.29) 

k ■ n 

Our notation here is inspired by the so-called K-G decomposition introduced by Grammer 
and Yennie [106]. The directions of the harpoons indicate whether it is the left or the right 
Lorentz index that is carried by the momentum k] G^^{k) (and K^'^) contains k^ , while 
K^'^{k) (and G^'^) contains k^. Notice that the standard Grammer- Yennie decomposition 
which is applied to the Feynman gauge propagators is in this notation given by 

N'/eyn = ^ = G^^ {k) + K^'^ {k) = ^g^'^ - ^— j + ^— . (4.30) 

The K-G decomposition is important in proving factorization in the hard scattering domain 
since Ward identities can be applied to the K terms which are the dominant contributions. 
Remember from the analysis in section |3.1| that there can be arbitrarily many longitudinally 
polarized gluons exchanged between the hard and collinear regions, H and C, and between 
the collinear and the soft regions, G and S. These gluons precisely correspond to the K 
terms. If we choose n such that n ■ A = A~^, then for the G terms we have 

G-+ik) = g-+- ^ = 0, G+-ik) = g+-- ^ = 0, (4.31) 

while for the K terms 

K-^{k) = ^ = 1, K^-{k) = ^ = 1- (4.32) 

For the dominant polarization N ^ we therefore see that only the K terms contribute. 
The key step to proving factorization is then to repeatedly apply the Ward identities on 
the K terms. 

If, however, k is dominated by its transverse component, then one can no longer neglect 
the transverse G contributions to which the Ward identities do not apply. If for example 
we have momentum which scales as / — A; in the above example, then 



\G-\l 



{I - ky 



{I - k)- 



>1, \K'-{l-k)\ 



{I - ky 



{I - ky 



> 1. (4.33) 



This means that the transverse components cannot be neglected in favor of the +— com- 
ponents. Moreover, even for the K terms, the application of the Ward identities leave 
non- factorizing remainder terms which are complicated. These can be neglected in the 
collinear limit but not in the Glauber region. Therefore in all the higher order correc- 



tions to figure 27 we must be able make all necessary contour deformations so as to power 



suppress these contributions. 
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Pb . 



Pa , 




Figure 29: The graphical representation of formula (4 



In the axial gauge, the singular propagators must be regularized. A canonical regu- 
larization is obtained by treating the singularities as principal values. Now, in [33] the 
regularization is instead performed by choosing 

N'^^ik) = g^^^ - p^^^^^ - t:^^. (4.34) 

k ■ n — le k ■ n + i€ 

Here the momentum flows from /x towards i'. The vector n is now chosen so that n-A = A~ . 
There is, however, a fundamental problem with this gauge, and it shows up already for the 



lowest order contribution in figure |29|. It is related to the fact that the light-cone gauge does 
not treat the hadrons symmetrically. We now demonstrate this problem by calculating the 
contribution in figure ^. 

The polarization vector of the produced gluon / is chosen in [33] to satisfy €"''(/) = 0. 
Since / • e{l) = one has e~{l) = l^e'^/l'^. The contribution from the process depicted in 
figure ^ is given by 

-gse;mSiPB,l- k){Gp^{l -k)- KpS - k))Vlf{G.o.{k) - K^^(k))Ll{pA,k) 

(4.35) 

where V is the three-gluon vertex. The dominant component of the lower part is L+ oc ^/s, 
while the dominant component of the upper part is U~ oc ^/s. We notice that in the above 
expression, 

UP{PB, I - k)Kp^{l -k) = LUpa, k)K^a{k) = (4.36) 

by the use of the Ward identity. One is then left with 

-9s4mS{PB,l - k) Gp^{l - k) Vlf AUk) LUpa, k). (4.37) 

It is easily seen that the leading contributions are 

G'^"(A;) Lb,,(p^, A:) « ^-"(A:) L+(pA, k) = g-'^L+ipA, k) (4.38) 

and 

UaAPB,l- k)GP\l -k)^ U-{pB,l- k)G+^{l - k). (4.39) 
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For the 7 index, the 7 = — component gives zero because of the gauge A =0, while 

|G++(/ - k)\ = Lll « IL_|I = |G+^(^ _ k)\ (4.40) 



so the leading term comes from 7 = 2. Prom ( 4.38| ), taking also into account the contri- 
bution from the complex conjugate amplitude, we see that we have for the lower part (we 
neglect the color indices for the moment) 

X -^ 
= ^TF)^ h^^^'^"iPA\{k-A{x) + k'A\x))\X,oni) 

{X,oni\{k-A{<d) + k'A\{)))\pA) 
= HtP\2 [ d'^xe'^'-'^ipAW A\x)\X, out) {X,out\k'A\0)\pA) 

(4.41) 

where in the second equality we used the fact that A~ = 0, while in the last equality we 
used the Ward identity. For the upper part we instead have for the leading term 

([/- G+' y (t/- G+^) = f d^xe'^^-''>''{pB\A\x)A\0)\pB). (4.42) 

In the gauge A~ = 0, the canonical definition of the TMD gluon distribution is (which 
directly corresponds to the parton model definition ( |2.1| )) 



' d^xe'''-''{pB\A'{x)A\0)\pi 



(2vr)4 
= j ^,^ j d'xe^^-^pB\F-\x)F-\Q)\pB). (4.43) 

This is for example also the case for the Weizsacker- Williams distribution in the CGC, 
with the only trivial difference being that in that case the pre-factor in the first line above 
is taken to be {k~)'^/p^ = xk~ instead of k~ (with x = k~ /p~j^). We notice, however, that 



the so-called dipole gluon distribution cannot be really fully consistent with ( [4.43 ) . The 



reason is that for the dipole gluon distribution, in the corresponding derivation one must 



actually set k to (this is why the Wilson lines (2.9) are integrated from —00 to +00 



in the longitudinal direction). One can therefore not multiply the definition with k as 



above, in order to obtain the canonical form (4.43). In that case one may instead multiply 
the integral by p^ . 

While it is straightforward to put ( f4.42| ) into the proper form, this is not so with the 
lower component ( 4.41| ). Going now back to the evaluation of the graph in figure |2^ we 
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thus have 



gse*^il)U-il - k)L+ik)^-LIlig~-vf'^ 



-gsU-{l-k)L+{k)r 
-gsU-{l-h)L+{k)f 



abc 



l--k- 
1 



abc 



-k~' 

"Fl+ 



7Q/3 

-e*-{il-kl) - e\r-k'){k~-2i-)] 

{li-kl) + 2e*\l'-k' 



where k has been neglected with respect to I . Using 21^1 = l\ one then gets 



2U-{l-k)Lt{k)r '- 

I' 



i\j2 



{e*n\li-ki)-e*\V-k')li). 



(4.44) 



(4.45) 



Squaring and summing over polarization and color indices, and integrating over k, we have 



9l 



m 



lE 



aa'bb'c -■- 



(2vr)' 



A{u-u;,^){LtL;,^)r'f 



(4.46) 



Now, to write a factorization formula for this result we have to untangle the color flow and 
at the same time make the appropriate kinematical approximations. Using ( [4.21 ), we now 
neglect k~ with respect to l~ in the U factors, and we set k^ = l^ in the L factors. The 
k~^ {k~) integral then acts only on the U (L) factors. 

For obtaining the differential single inclusive cross section we project the diagonal color 
components in U and L, and we find that the result can be written as 



da 



1 1 4.glN,{2^Y 



dycPlj 



25 2(27r)3iV2-l ll J '^'^^ J J^T.^-^-^^^ 

J {2^Y^ " 



^r-l^kl 



. (4.47) 



k+=i+ 



We notice that up to this point the arguments have followed very closely those in section 
3.1.3| that lead to equation ( 3.1S ). However, as we discussed after equation ( p.l9| ), a 
more careful treatment is needed since the integration over the momentum will include 
contributions which are not strictly in the region where the above kinematics holds. What 
we saw in equation ( ^.21 ) was that this could be treated by including subtractions in the 
hard factor. In this case, we instead need subtractions in the last factor of ( [4.46 ). In fact 
one must correctly treat the gluon production factor, the analog of the hard region, to all 
orders and make sure it is gauge independent. This, however, does not affect the definition 
of the gluon distribution. 

Now, for the first bracket containing the upper blobs in ( [1.47[ ) we have from ( 4.42[ ) (we 
keep the summation over the color indices implicit) 

{l-f f dk+ 



1 
Pb 



dk+ 

(2^ 



u-u-Hi^-k^y 



fe-=o Pb 



{27rY 



d xe 



i(l—k)-x 



{pB\Alix)AimPB) 



dx+d^ 



■^-L il~x+-{lx-k±^-x±^ 



(27r)3p- 



{pb\F-\x+Q-x^)F-\0)\pb) 



(4.48) 
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where we have chosen to include the factor 1/p^ into the definition. For the lower blobs, 
however, we cannot get the standard formula because of the asymmetric gauge choice 
A~ = 0. Using ( [1.41 ) we have (again keeping summation over color indices implicit) 



1 

Pa 



dk- 



LTLt^ e 



E^ 



(27r)4 '' ^ 
dk~ k 



X 



p+J (27r)4(fe-)2 



k+=i+ 

ikx 



d''xe"'''{j)AWAl{x)\X,ont){X,ont\k'Al{Q)\pA) 



k+=i+ 

(4.49) 



This expression is clearly different than ( [4 .431 ) or ( [4.48 ), and does not correspond to any 



know distribution. One therefore does not obtain formula ([4.^). 

Let us now explain the other difficulty with the light-cone gauge that we mentioned 
just above equation ( 4.35| ). As we have seen, in A'^ = gauge we have a problem with 



the definition of the parton distribution for particle A which moves in the + direction. 
Similarly if we chose A~^ = gauge, then we will have a problem with the definition for 
particle B. Let kA,B denote momenta attached between the collinear regions A, B and any 
other region such as S or H. Where kA attaches to A, the collinear lines of A will force 



k^ to generally be small as in figure 28. If we now work in the A~ = gauge it means we 
additionally have the 1/^^ pole at the origin, and the combined poles from the propagator 
and the collinear lines of A will then generally pinch k^ at the origin. This, however, means 
that the higher order terms cannot be deformed out to A;^ ~ Q to power suppress them 
(terms for example such as G*"*" will be large). The gauge A~ = therefore fails for the 
gluons attaching to A. A similar argument for B shows that the A~^ = gauge similarly is 
not useful. 

4.4 Non-light-like symmetric axial gauge 



To get a formula that looks like (^^) one must instead choose a gauge that treats the two 
hadrons symmetrically, this can for example be done by choosing the non light-like axial 
gauge A'^ + A~ = 0, i.e. the temporal gauge A^ = 0. Using this gauge, one can again 
eliminate the extra gluon couplings to the collinear regions. We will here use this gauge 
to derive ( [4. 8]) and at the same time we will see what the definition of the TMD gluon 



distribution is. However, in section ^^ we will explain the general case, and demonstrate 
the problem that is inherent in this axial gauge treatment as well. 

In the gauge A'^ + A~ = 0, the numerator of the gluon propagator is given by 



iV^-(A;) = 5^- -Z_ + __^ (4.50) 



n^'k" + n''k^' k^'k'^n'^ 
n ■ k (n • k)^ 

where n ■ k = k~^ + k~ for any k. The contribution in figure p9| gives 

-gs^m^ipB, I - k)Np^{l - k)Vlf N„^{k)Ll{pA, k). (4.51) 

The last term proportional to ■n? in (|4.50D then cancels in both propagators above when 
the Ward identity is applied on U and L. One is then left with the same expression as in 
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( 4.35 ) which again reduces to ( [4.37| ) when applying the Ward identity. As before we have 
that 



and 



G'^'^ik) Lt,a{PA, k) « G-'^ik) L+{pA, k) 



UaAPB,l- k) GP^{1 -k)^ U-{pB, I - k) G+^l - k), 



but in this case the leading G terms are different. We have 



G++(/-fc) 
G+~{l-k) 
G+\l - k) 



l+-k^ 



1+ -k+ + 1- - k- 

l- -k- 
1 



^+-fc^ 



1+ -k+ + 1- 

r -k^ 



and 



G-ik) 
G-+(fe) 
G-\k) 



1+ -k+ + l- 



k- 



k- 



l- 



r -y- 



m 

Vs 
m 
\/s 



l- 



m 
= 1 

m 



k+ + k- 



k+ 



m 



k+ 



k+ +k- 



m 
F < 1 



k' 



k+ + k' 



A;^ 



m 
~ — = 1. 
m 



(4.52) 
(4.53) 

(4.54) 
(4.55) 
(4.56) 

(4.57) 
(4.58) 
(4.59) 



The leading contributions are therefore the transverse components in both sides. Squaring 
the contribution from figure ^ and summing over gluon polarizations one is then left 
with (we neglect for simplicity the color factors since they are exactly the same as in the 
light-cone gauge calculation above) 

,J(C/-f/-t)(L+L-t)___2____^ X 

Y, [-3(*:+Ea + k-4)(kl - l^ ■ k^) + cik'll - 4l'(-kl + 2ix . k^)f . (4.60) 



We shall next choose the external polarization vector to satisfy e =0, which means 
that e^ = e*/*//". Then the first term in the sum above gives 



-2^e'f{kl-l^-k^) 



(4.61) 



which is of the order of a transverse component multiplied hy k /I ~ mj \fs <^ 1 and can 
therefore be neglected compared to the other transverse terms. One then gets 



9UU-U'^)iL+L+^) 



1 



1 



{1+ -k+ + 1- - k-y ik+ + k-) 



likiii^-k^y 



(4.62) 
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Inserting now all pre-factors and color indices, we get for the gluon production cross section 
da 1 1 {2tiY g^N, 



dycPl^ 4 2(27r)3 N^ - 1 



(4.63) 



X / d'^k 



UaUa\l±-k±f 

p^(27r)4 



p+(27r)4 



/2 



{i+- k++ 1-- k-y{k++ k-y 



To define the TMD gluon distribution we now notice that 



U-U-^h-k^)^ = {l+-k++l--k-f j d''xe'^'-^>^{j}A\A'Ax)A'M\PA) 

= 2 /d4xe^('~^)-"(pB|F0*(x)F0^(0)|pB), (4.64) 



where F°* = (1/a/2)(F+' + p-^). Similarly 



L+Lpkl = 2 I d''xe^'^^pA\F^\x)FJ^\0)\pA). 



(4.65) 



To obtain the canonical forms of the two gluon distributions, we notice that we can drop the 
F~^ contribution in ( 4.651 ), since it gives rise to the contributions k~U, k^L~ and U~ which 
are all power-suppressed. Therefore we might as well replace F^^ by F^^/^/2. Similarly for 
the expression in (4.64) we can replace F^^ by F~^/^/2. To get the factorization formula, 
one further needs to approximate k^ = in the upper part, and k^ = l^ lower part. 
Furthermore we applied the approximations from the kinematics in ( |4.19| )- (|4.24D in the 
last factor in ( 4.63| ) which can then be written as (up to power-suppressed corrections) 



/ 



/ 



{i+-k++i--k-y{k++k-)^ {i-Y{i+Y lY 

Thus we find 

da 27r^ a " 



(4.66) 



dy d'^lj 



Cpll 



d^k\ 



dk+ 1 

WfPB 



U^U~\l^-k^f 



dk- 1 

(2vr)* p\ 



-LtLth' 



d^k^fsixB, l± - k±)fAixA, k^, 



(4.67) 



with 



fAixA,kA 



dx d x± 
(2vr)3p+' 



ixAP~^x: —ik \ x 



■=^{pa\F+\0+x-x^)F+\0)\pa), (4.68) 



and 



fBixB,l± - k±) 



dx^d'^x± 
where xa = l^ /p\ and xb = l^ Ip~q. 



CBPgX+—i{l—k)±xj_ , 



{PB\F-\x+o-x^)Fr{o)\pB),{^m) 
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4.4.1 The coefficient of the formula 



As for the coefficient in front of formula ( [4.67| ), we note that different values appear in the 
literature. Let us denote the coefficient in ( [4.67 ) by 



In the papers [20,26,27,29,31,41] we instead find the formula (this is the value we used in 
writing (|4^)) 

2a, _ 4iV,«, 



while in [96] we find. 



and in [95] 



Similarly we find in [28] 



^-W?' '"'^ 



C = 27riVca,. (4.73) 



(2vr)«C^ (4 74) 

and in [30] 

"= ATI = SI ^"-^ 

where i^ is a fit parameter which is quoted to be of the numerical value 1.5-2. We see 
that the coefficients in (|]7l|), (|^), (|4?7^ ), (|4?7^ ) and (|]7|) are ah different from each 



other. It appears also that none agrees with the result above, equation ( [4.70| ). Our result 



(4.70) on the other hand agrees with the result in [107] where it was indeed observed that 



an extra factor vr for each TMD distribution must be included to agree with (4.71) above. 
The numerical differences between the pre factors used in different papers are clearly 
rather important. It should also further be noted that in the papers [16,30] the k± integra- 
tion is performed only up to l± while such a bound does not appear in the other papers. 
Moreover in most of the phenomenological applications the coupling a, is taken to run 
with some scale which also differs from paper to paper. 

4.5 Higher order terms in axial gauge, and more complete view 

From the contribution in figure ^ we have thus seen that we can in the non-light-like axial 
gauge, A^ + A~ = 0, obtain the formula ( [1.8| ) where the TMD distributions are given by 
( 4.68 ) and ( [4.69|) . We notice that exactly the same gauge is used in the CCH formalism [66] 



and in the GLR paper [96]. 
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The question is of course what happens when we include higher order corrections to 



figure 29. We will now in this section first prove that the axial gauge does indeed eliminate 
to leading power the couplings to the collinear regions, and at the same time we will see 
what kinematics is necessary for this result to hold. We will show that the kinematics is 
actually opposed to the usual small-x kinematics. Thus for the higher order corrections to 
be generally negligible we will need contour deformations to ensure the desired kinematics. 
We shall then give an argument for why the needed contour deformations generally fail in 
the axial gauge. 

Assume now that we have a collinear region C which carries momentum lines that are 
large in some direction wc- For example this could be region Ca which has large momentum 
in the + direction. Let wc be the conjugate direction to wc, such that wc • wc = 1- The 
large component of C^ is then given by wc ■ C. We now choose the axial gauge n ■ A = 
where n is not necessarily light-like. Let V be any vector. We then have 

V-C = V-wcwc-C + p.s.c. (4.76) 

where "p.s.c." as before stands for "power suppressed corrections". Now we let V = n, and 
using that we are in n • A = gauge, we obtain 

= n ■ C = n ■ Wc Wc • C + p.s.c. (4.77) 

Assuming now that n ■ wc 7^ 0, we can separately scale the gauge vector 

n -^ (4.78) 

n ■ wc 

for each collinear region in the graph to get 

= n-C = wc-C + p.s.c. (4.79) 

Thus we conclude that the leading term vanishes in the axial gauge, and only power- 
suppressed contributions remain. Notice that if n ■ wc = then we cannot necessarily 
conclude that the leading contribution is eliminated. It might also be that, depending on 
the exact kinematics, several directions of C^ simultaneously become important. In that 
case the advantage of the axial gauge vanishes. Let us illustrate these points with some 
examples. 

Consider now a gluon k coupling to region Ca, and denote C^ = N'^''^{k)CA,v It is 
actually then Ca that corresponds to C above (since n ■ Ca = but n ■ Ca / 0). Assume 
we are in the A^ = gauge. Then 

C\^N+-C\={l-^-^C\ = ^, (4.80) 

C\^ N''-C\={), (4.81) 

C- ~ N—= 0. (4.82) 

Therefore only power suppressed contributions from A will remain (we could have also 
immediately seen this from the fact that n ■ Ca = + p.s.c). On the other hand if we 
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choose the gauge A~^ = then 



Ct-N^-CX={l-'^]CX = 0, (4.83) 



/t^ 



k' 



C\^N^-C^=-—CX, (4.84) 

C^~iV-C+=-^C+. (4.85) 

Here we see that C\ and C^ are suppressed only if k~^ is the dominant component of k. 
If not, then in the higher order terms all contributions can be important and the situation 
obviously gets complicated. The gauge A'^ = is useful in DIS where the target hadron 
has large P"*". In hadron-hadron collisions, however, as we have seen, the light-cone gauge 
cannot be used. There is moreover the problem with rapidity divergences which appear in 
TMD distributions via integrals like ( 3.27| ) (the light-cone distribution ( [4.42 ) for example 



leads to divergences and is therefore ill-defined). These divergences become visible starting 
from one loop calculations. Now assume we are instead in A~^ + A'^ = gauge. Then 

61 ~ A'-'^l - (l - ^ + 1^) CI ^ II^Cl, (4.0) 

If for example k is collinear to Ca, then indeed the contributions are power suppressed. 

Thus for the axial gauge to be useful, the momenta emerging from Ca {Cb) should 
be collinear to Ca {Cb)- Actually none of the momentum components need to scale with 
y/s, but the dominant component should be k'^ (or fc^ for Cb)- Remember indeed from 



our classification scheme in section 4.1 that momenta which have no components scaling 
with y/s but whose components along Ca dominates are still classified as belonging to Ca- 
If, however, we are in a region where for example k± dominates, then we see that we have 
a large contribution from the transverse components. In that case we cannot neglect the 
higher order corrections. This is why we must be able to always deform the contour into 
the region where k'^ (or k'^ for Cb) is the large component. 



The analysis above and in section 4J suggests a general picture like in figure |30|. We 
consider the case where the observed hadron, pc, has some component scaling with Q, 
the reason being that the scale Q is needed to suppress the higher order corrections as 
seen above. The regions in figure pH are to be understood in the classification presented in 



section 4.1. The momentum Q is fixed and Qj \fs — t- asymptotically. There are actually 
further lines going out from the hard region which give undetected collinear regions but we 
do not show them in figure ^ for simplicity. According to what we have just seen above. 



in axial gauge we generally expect the contributions in figure 3C to be reduced to that of 



figure 31. Here the extra collinear-to-hard gluons are missing, and the remaining gluons 



coupling to K are transversely polarized (indicated by black squares). 
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Figure 30: Leading regions for single inclusive hadron production via gluon initiated jet in hadron- 
hadron collisions. There is an additional collinear region associated with the produced hadron pc- 
There will generally also be additional collinear regions associated with unobserved jets, these are 
not shown here for simplicity. 




Figure 31: Single hadron production in axial gauge where the extra collinear-to-hard can be 
eliminated. The collinear regions then couple to the hard region via a single transversely polarized 
gluon, indicated by the black squares, each. 



Note from figure 31 that the soft region still remains. Indeed the analysis above does 
not directly apply to the soft region since we needed a scale Q to suppress the higher order 
terms. To simplify the expression completely then, one must be able to show that the soft 
region can be eliminated or neglected. 

In figure |3^ we show examples of soft gluons exchanged between the different regions. In 
the first graph (top left) the gluon k attaches to the collinear-to-i? gluon that goes into the 
hard scattering. The momentum k then runs in a loop from top to down, counterclockwise, 
via H into A and back again. The line k^ — k then gives a pole (taking all k± to be of 
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Figure 32: Examples of graphs in axial gauge where soft gluons are exchanged between the 
coUinear regions. Each of these type of emissions require contour deformations in different directions 
to stay out of the Glauber region. 



order m) 



1 1 

_ ra m 

k ~ — ; ie ~ — - — ie. 



Q 



(4.89) 



Inside the lower blob A, k will run along the large momentum p^, and so there will be a 
typical pole of the type 



k- 
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V~s 
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le. 



(4.90) 



We thus see that these poles pinch the integration contour of k~ . It might also be that k 
in the lower blob attaches to a line with plus component only of order Q instead of ^/s, 
but in any case we see that A;~ is at least forced to be small as m?/Q. One can still save 
the power counting arguments if k~^ can be deformed far out so that k^k~ ~ k"^. 

We must now, however, exactly specify how to treat the singularities of the axial gauge 
propagator (4.50). The canonical regularization of these singularities is given by the prin- 
cipal value prescription. The canonical regularization is useful because the corresponding 
generalized functions then obey elementary relations, such as ordinary differentiation, that 
are obeyed by the corresponding regular functions [72]. The use of principal value, however, 
also implies that one cannot deform the contours. The variable k ■ n must therefore remain 
on the real axis. 
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As we have seen above, for the contributions in figure g2| we must deform in the first 
graph (top left) k~^ but not k~ , in the second graph (top right) k~ but not k^ while in 
the last graph (bottom) we must simultaneously deform kf and fe^ while keeping k^ and 
/cj fixed. We then, however, see that these requirements are in contradiction with the fact 
that we cannot deform on k ■ n. For example, deforming in the first case k~^, i.e. letting 
A;+ — )• /c"*" + iC for some large C ^ Q, but keeping k" fixed implies that 

k.n = k'^ + k' ^ {k^ + iC) + k- = k-n + iC (4.91) 

which is not allowed. The required contour deformations therefore fail. We thus conclude 
that the treatment in axial gauge is not complete. 

One may also consider the possibility of using the so-called "planar gauge" introduced 
in [105]. In this gauge, the gauge vector n is non-light-like, so that n^ ^ 0, but the last 
term in the axial gauge propagator ( [4. 50] ) is eliminated (by a clever choice of the gauge 
fixing term in the Lagrangian). Moreover, as shown in [105], Faddeev- Popov ghosts are still 
absent, just like in axial gauge. This gauge has thus all the advantages of the axial gauge, 
and in addition is free from the double pole in the propagator. It is therefore certainly 
much better behaved. However, the unphysical singularity 1/k ■ n still remains and must 
be treated via the principal value. Therefore the above arguments still apply to this gauge. 
In [105] the authors argue that, since the propagator poles are unphysical and have to all 
cancel at the end of the day, one might as well treat 1/k • n as a regular function, excluding 
this pole from loop integrals. The problem, however, is that one still needs to perform the 
contour deformations to prove factorization, and in doing this the term 1/k ■ n cannot be 
neglected in the intermediate steps, even if the final result should be free from unphysical 
poles. 

It is of course possible that one chooses a regularization which is not principal value. 
For example, we saw above that the choice in [33] for the light-cone gauge is given by 
(|4.34). In any case, however, it is very hard to see how exactly a systematic procedure is 



developed that is capable of treating graphs of arbitrarily high order, as is required for the 
full proof of factorization. As far as we aware of, this has never been done. We leave the 
possibility open that a treatment in axial gauge might work out, but it is difficult to see 
how this would be achieved. 

4.6 The gluon distribution function 

We have systematically gone through single inclusive particle production at high ener- 
gies, and we have concentrated especially on the small-x factorization formula (|4.8| ). In 
this section we examine more closely the exact definition of the TMD gluon distribution. 
We will moreover at the end of the section make some final comments on the validity of 
factorization. 



According to ( |4.1l| ), the gluon distribution is a (modified) Fourier transform of the 



dipole scattering amplitude in the adjoint representation. The expression ( 4.11 ) is appro- 



priate in a covariant gauge, and not in an axial gauge. In the canonical definition of the 



parton distributions, the direction of the Wilson lines in (4.12) are taken opposite to the 
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hadron, i.e for a hadron moving with momentum pA {pb), the direction is taken as ub 
(ua), which is paraUel to pB (pa)- To leading power we can also take the directions to be 
nA + nB for both hadrons, and the axial gauge (ua + tib) • ^ = then sets the Wilson 
lines to unity. At first sight, however, this does not seem to be strictly correct because if in 
( |4.12 ) we set the Wilson lines to be unity then we find that ( 4.12| ) vanishes. A/" = 0, which 
obviously cannot be true. Part of the answer is that a fully gauge invariant definition of 
( |4.12 ) requires that we also insert transverse gauge links at infinity, and these are non-zero 
in any axial gauge. However, to match the axial gauge expressions, ( 4.68 ) and ( 4.69| ), one 
must also express the distribution ( 4.12| ) using the field tensors F"*"* and F~\ Let us now 
see how this can be done. 

It is in fact a fundamental property of all gluon distributions that the field tensors 
F^^'^ appear in the definitions. The underlying reason for this comes from the elementary 
parton model definition ( |2.lD . As the QCD definitions are appropriate modifications and 
generalizations of the parton model result, it is then natural that the field tensors appear 
in the definitions of the integrated and TMD gluon distributions [1] . This is also the case in 
the construction scheme for the generalized TMD distributions given in [108,109]. It should 
therefore also be possible to write the dipole distribution ( [4.11 ) using the field tensors, if 
it indeed is a TMD gluon distribution as claimed. 

Consider the lowest order contribution from ( 4.12| ) where we insert a set of outgoing 
states |X, out) between the Wilson lines and then expand each Wilson line to first order 
in gs- We will assume that the averaging in ( [4.12| ) is given by an ordinary expectation 
value between momentum eigenstates of the hadron, but we are not actually sure whether 



this is consistent with the formalism from which (|4.12D is supposed to arise. Nevertheless, 
without this assumption we cannot make any real comparison. We also neglect for the 
moment the regulator y in ( 4.11 ) and ( 4.12 ). The first order expansion of ( 4.12| ) in ( 4.11| ) 
for a hadron with momentum pA gives 



f^Hk, 



Nr. 



E 



g'sNc 



X <^ 



1 



(27r)4a 



dx 



-kj 



dy~ 



lj)A\A+{x-,Xi_)\X, out)(A:, ovit\A+{y-,y^)\pA) 
{pa\pa) 



(4.92) 



The argument to convert k'^A^ into F^'^ can now be made as follows. In the power 
counting of the contributions from the region collinear to pA, the largest contribution 
arises from the + component as we have seen in sections 3.1.2| and 4T. In the A^ gluon 
exchange term, the biggest contribution therefore arises from the terms where we pick up 
the contribution A^ ^ for all the A^ collinear-to-pA gluons. For every contribution where 
we change one of the gluon polarizations from the longitudinal index + to a transverse 
index i, we lose one power of ^/s. Thus one can let 






(4.93) 



since the correction produces a power suppressed term. It is important to notice that this 
exchange is not permissible in the hard scattering factorization. From the power counting 
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in section 3.1.2 we actually see that k^A~^ ^ mQ and fc^A* ^ Qm for a collinear-to-^ 
gluon k. In the small-x case, however, A;"*" <C -y/s, so that k'^A^ <C y/sm ~ k^A~^. 

For the lowest order term in ( [4.92| ) this is enough to convert each k^A^ into F^^ since 
the commutator in F~^^ contributes at higher order. Removing the sum over the states X, 
one can then rewrite (4.92) as 



/.?'(tx) "' '■"■ 



dx d x±_ 
2TTasN^-lJ (27r)32p+' 
N^ f dx~d^x±_ 



-ik_i_-x_i_ 



(p^|F+*(0+x-xx)F+^(0)|pa) 



m-i 



ik±-x± 



(2^)M 



{pa\F+\0+,x-,x^)F+\0)\pa). (4.94) 



In the dipole model from which ( |4.12 ) arises, the large Nc limit is employed which means 
that the coefficient N^/{N^ — 1) is set to unity. The result ( f4.94 ) then very strongly 
resembles ( 4.( 



We note, however, that in (4.94), there is no x dependence as in (4.68). This is 
a characteristics of the dipole formalism where the longitudinal component of the total 
momentum coupling to the collinear region is neglected. The rapidity dependence of the 
dipole distribution therefore purely arises from the rapidity cut-off. In (4.68), the rapidity 
cut-off is not yet included, and the xa variable which is the longitudinal momentum fraction 
of the gluon k in figure ^ clearly does not play the role of a rapidity cut-off. This is also 
one of the reasons why the dipole distribution ( 4.11 ) or ( ^.10 ) cannot be directly related to 
the integrated distribution as in ([4.10| ) , since the meanings of the longitudinal variables in 



( 4.10 ) are completely different on the right and the left hand sides. Despite this, however. 



the relation ( 4.10 ) is still widely advocated in the small-x literature. 

When all the gluons coupling to the collinear region contribute with their longitudinal 
polarizations, however, there must be certain cancellations due to the Ward identities. In 
Feynman gauge the easiest way to see this is to use the K-G decomposition ( 4.30| ). Ward 
identities apply on the K terms, and these correspond to the longitudinally polarized 
gluons. For the region collinear to pA, we choose the vector n in the K-G decomposition 
( 4.301) to be in the opposite direction to pA, i-e. n = ns (and the other way around for 
the B terms). Then as we saw in ( [4.31| ) and ( [4.32 ), the longitudinal components vanish for 
the G terms while for the K terms we get unity. The largest contribution therefore arises 
from the terms where we only pick up the K terms. Ward identities, however, imply that 
part of this largest contribution cancel, leaving behind a reminder term which is of the 
same order as the contributions where one gluon contributes as G*~, while all the other 
terms contribute via the K^ terms [1,71]. It is then the combination of the G*~ term and 
the remainder term from the Ward identity cancellations that give rise to the field tensor 
term F~^^ (including the commutator term) while the sum over all the K^ terms give 
the Wilson lines. We explain this in the context of the small-x calculations in [42] where 
we derive the TMD gluon distribution that looks like ( |2.5D . That is, a gluon distribution 
including the F^* factors is naturally constructed. 

Let us now extend the above analysis to all orders. In [108,109] a construction scheme 
of TMD parton distributions was proposed. The proposed scheme is a method of converting 
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Figure 33: The elementary graph for the gluon production. 

the collinear-to-hard gluons to Wilson lines, thus giving the "unsubtr acted" TMD parton 
distributions. We now apply the scheme to the present process. 

The scheme starts from studying the elementary "hard" graph for the process under 
consideration, that is figure ^. Of course here this graph does not involve any hard 
momenta, but that does not really affect the structure of the Wilson lines which parametrize 
the non-perturbative structure. According to [108, 109] then, the contribution from the 
process in figure [3^ to the TMD gluon distribution of the lower particle (with momentum 
pa) is 



(+)^ 



^{-)^ 



F,,{x) Fkio) ir^ ir " ' {w'^')cAw's') 



(4.95) 



where 



W: 



(i) 



B 



VFb(0;±oo ,0_l)Wt(±oo ,0^;±oo ,x_l)Wb(±oo ,x^;x ,Xi_) 



(4.96) 



and 



WB{x;y) = Pexp (-igs f' dzus ■ Aa{z)TA , 



Wxix^y) = Pexp 



-i9s 



dzi_ ■ A^^a{z)T^ ■ 



(4.97) 
(4.98) 



If we instead consider the TMD distribution of the upper hadron with momentum pB then 
the longitudinal direction in ( [4 .961 ) should be + instead of — , and in (4.97) ub —5- ua- 
Notice that in ( [4.95 ) the Wilson lines are in the adjoint representation as is clear from the 



color subscripts. We now use T^^ = if^"^ for the adjoint representation to rewrite ( 4.95 ) as 



= Fa>Ax)Fac{0) {Wi+^),AWB )aa' 



B Icdyy^B laa' 



(4.99) 



where we have defined 



Fn, 



FuTt 



(4.100) 
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From equation ( 4.9S| ) one then finds the following contribution to the correlator in the 
gluon distribution 



{pa\ (F{x)wi^ 



(+)t 



F{0)wi, ) 

^(+)t 



\Pa) 



(-)i 



Tr{pA\F{x)W'^-" ^ FiO)W'^ >\pa). 



(4.101) 



The trace is taken with respect to the adjoint representation with the field tensor defined 
as in ( [4.1001 ). The (unsubtracted) gluon distribution function is then given by 



fAixA,k±) 



dx d?x± 



^ixAP^x -»fex-^'x'iv(p^|i?+*(o+x-xx)W^"^^^F 



(+)t; 



\0)wi-'>\pA) 



(4.102) 



Actually, note that in the canonical definitions (2T) and ( [4. 431) we would instead of 1/pa 
insert the factor 1/A;J = 1/{xap\)- The reason we choose 1/pJ^ here is that we will connect 
the above distribution with that of the dipole result ( [4.11| ) and remember from above that 
the dipole result cannot be obtained if we have the factor 1/k^ (see also remarks just 
below) . 

Strictly speaking (4.102) involves only the bare fields. Remember from section 3.1.3 
that the gluon distribution has to be renormalized as in equation ( |3.25 ). The soft region 
must also properly be subtracted to cancel the rapidity divergences in ( 4.102| ). A similar 
definition is easily obtained for the gluon distribution associated with ps 



fBixB,k±) 



dx+(f 
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(2^)3 p- 
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-ik I -x 



Ti{pb\F-\x'^,Q-x±)W^^^^F 



\^)W^A^ 



Pb) 



(4.103) 



Exchanging to leading order the Wilson line directions to ha + ns in both cases and 
applying the axial gauge {ua + n^) • ^ = we then obtain ( [4.68 ) and ( [4.69[ ) respectively. 
There is an additional factor Nc arising from the color traces in ( [4.102[ ) and ( [4.103| ) (exactly 
as in ( [4.94 )). Thus we can see ( [4.102 ) and ( 4.103|) as possible generalizations of ( [4.68[) and 
( 4.69[ ) to arbitrary gauge. 

The connection to the dipole formula (4.11) and ( [4.12 ) can now be made as follows. 
We consider the transverse derivatives in ( 4.11 ) acting on the Wilson lines in ( 4.12| ). The 
effect of the derivative can be written as (for the hadron pa) 



diW{xA_) = -igs / dx^Wsix] 00', xi_)di At {x)T"'Wb{-oo',x^]x) 



(4.104) 



where as we recall W is given by taking ( [2.9] ) with the adjoint color matrices while 
Wb{x;co~x^) and WB{—<y^~x^;x) are given by ( [4.97 ). We can again use ( [1.93 ) since 
the correction is power suppressed. One can also argue that the commutator of the field 
tensor is subleading since at given order in Qs it contains one factor A^ which replaces a fac- 
tor A'^ from the Wilson line. In that case we could replace —idl.A't{x)T^ — t- F^^T^ = F~^^ 
in ( [1.104[ ). This would imply that ( [4.11[ ) contains the same structure as in ( [4.102[ ), once we 
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also set X = in ( 4.102 ) which as we remember from above is the standard approximation 
in the dipole formahsm. 

Thus as we have seen, in a sense the formula ( [4.11[ ) together with (4.12) contains the 



contributions from the gluon field tensors as in ( |4.102| ). We motivated this by the power 
counting arguments, but a word of caution is in order here. We have mentioned above 
that the K terms in the K-G decomposition are subject to certain cancellations from the 
Ward identities. This implies actually that terms containing one factor of A^ at each side 
of the cut become leading. As explained above, these arise from the G*~ terms. Thus 
the transverse components in F"*"*, including the commutator, may not be automatically 
dropped. The expression in ( [4.102| ) is therefore more correct than ( 4.11| ), assuming of 



course that factorization holds. If not, then neither expression needs to be correct. Let us 
therefore now finish our analysis with a discussion on the validity of factorization. 

What we have thus seen is that ( [4.11 ) and ( 4.12D can be related to the distribution, 



( |4.102| ) or ( 4.1031 ), constructed using the scheme of [108, 109]. However, the scheme in 



[108,109] by itself does not prove whether factorization holds or not. When a TMD parton 
distribution associated with a given coUinear region is being constructed, one considers 
the attachments of the collinear-to-hard gluons to each line of the hard graph, and replace 
each set of connections by a Wilson line that correctly carries the color of the hard line. 
Since TMD factorization is used for two particle production in the almost back-to-back 



region, as in the examples of e^e~ annihilation and Drell-Yan production in section 3.1.5 , 
the relevant hard graphs are usually 2—7-2 partonic graphs, and one can then use these 
basic graphs to construct the possible gauge links for a given collinear region. An extensive 
list of possible gauge links is given in [109]. 

For proving factorization, however, one must consider all gluon attachments from the 
collinear regions to the hard graph simultaneously, as well as all possible soft attachments 
between the collinear regions. For example, in ( [4.95| ), following [108,109], the attachments 
from the collinear regions Ca and Cb in figures ^, ^ or ^ are considered separately, 
and each is summed into the Wilson lines in ( [4.951 ). Considering all possible attachments, 
however, as for example in the graphs in figure |32|, it may very well be that the resulting 



structure is more complicated than in ( 4.95| ) or that it is not even possible to identify any 



gauge link contributions to the TMD distributions. At the same time, one must be able 
show that deformations out of the Glauber region are possible, or that the poles producing 
the Glauber pinch cancel. Cancellation of the Glauber region has been demonstrated 
explicitly in the case of Drell-Yan (Ch 14, [1]), but difficulties may easily arise for the more 
complicated processes studied in [108,109]. 

In reference [77], the breakdown of ordinary TMD factorization (i.e. the TMD fac- 
torization that is relevant for the processes in section 3.1.5) was explicitly demonstrated 



in di-hadron production in hadron-hadron collisions at the level of 2 gluon exchange be- 



tween the hard part and the collinear part. We illustrate in figure 34 two examples of the 
type of graphs considered in [77]. To distinguish the hard scattering we draw the hard 
gluons by zig-zag lines, while the collinear-to-hard gluons are illustrated by curly lines. In 
the elementary model considered in [77], the gluons are massive Abelian gluons, and the 
active lines that enter the hard scattering are scalar "di-quarks" while the spectator lines 
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Figure 34: Production of two liadrons in an elementary model considered in [77]. We indicate the 
hard scattering by the exchange of the zig-zag lines. The additional gluon contributions correspond 
to breakdown of ordinary factorization. 





Figure 35: Examples of the type of graphs that are taken into account in the construction scheme 



of equation (4.95). The solid lines indicate the spectator parts of each hadron. 



are fermions. The breakdown of ordinary factorization is then understood as being due 
to the attachments of the coUinear gluons from the lower hadron lines to the upper active 
"quark" line which is of course color connected to the upper hadron. The collinear-to-py^ 



gluons in figure 34 which couple to the upper active lines of the hard part are precisely 
the gluons that in the scheme of [108, 109] give rise to the gauge links of the generalized 
TMD distributions. The construction in ( [4.95 ) therefore contains these contributions. We 
illustrate these in the single gluon production case in figure ^. 

As discussed above, however, for a complete proof of factorization one must also con- 
sider the simultaneous gluon couplings between the upper hadron and the hard part. This 
was considered in reference [78] which calculated in a slightly different model than [77] the 
type of graphs shown in figure ^ (the zig-zag lines for example correspond to a massive 
color singlet scalar boson). These graphs have an entangled color structure which makes it 
impossible to factorize the color flows even in the scheme of [108,109]. The examples shown 
in figure ^ then break factorization for the Double Spin Asymmetry (DSA), while in the 
specific model considered the contributions from figure |3g to the unpolarized cross section 
cancel. Breakdown of factorization for the unpolarized cross section instead appears for 
graphs where three additional gluons are exchanged, with at least one gluon coupling to 
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Figure 36: Examples of the class of graphs considered in [78] that lead to the breakdown of TMD 
factorization for DSA. We indicate the hard scattering by the exchange of the zig-zag lines. 
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Figure 37: Examples of graphs where TMD factorization is broken for the unpolarized cross 
section. We indicate the hard scattering by the exchange of the zig-zag lines. 
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Figure 38: Examples of the type of graphs that may go beyond the construction scheme of 



equation ( 4.95 ) in QCD. The solid lines indicate the spectator parts of each hadron. 



each hadron. We illustrate this in figure 37 



What this shows to us in the case of gluon production at small-x is that to answer the 



question of factorization one needs to consider graphs like in figure 38. These graphs have 
non-trivial color flows that do not seemingly factorize into color singlet factor associated 
with each collinear region. In that case one must demonstrate explicitly that such contribu- 
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Figure 39: Single liadron production where the second jet emerging from the hadron is integrated 
over. Arbitrarily many gluons can be exchanged between each collinear region and the hard region, 
as indicated by the dots. We do not show the soft region. 



tions cancel. Given, however that they do not even cancel in the simple models considered 
in [77,78] it seems rather difficult to see how they would in full QCD. Indeed we note that 
the results from [78] have further been systematized in [110] where simultaneous couplings 
to different parts are considered, generalizing the scheme in [108,109]. The difficulties with 
the color entangled contributions are there clearly demonstrated. 



We mentioned earlier that the gluon production in figures |2^, ^ |3^, ^ and |38| corre- 
sponds to the case of soft particle production, illustrated in figure ^. To instead consider 
hard gluon (or rather hadron) production with large transverse momentum, so that a scale 
Q is present which can be used to suppress transverse polarizations, we need to take into 
account that the hard part contains additional jets. It can be shown that the case where 
more than two jets emerge from the hard region is suppressed in the almost back-to-back 
region [1]. We then consider the case where two gluon jets emerge from the hard region, 
and where only one of them contains the detected hadron. We illustrate this case in figure 
3S. 



The case in figure ^ equals to taking di-hadron production and then integrating over 
one of the hadron momenta. The 2 — )• 2 hard scattering is now more intricate, and the 
scheme of [108, 109] becomes rather complicated as can be seen from table 8 in reference 
[109]. More importantly, however, the results in [77,78,110] become highly relevant and 
show us that generally factorization is broken in di-hadron production. Cancellation of 
the factorization breaking terms occur for the integrated distribution, but not if we merely 
integrate over the momentum of one of detected final state particles. In fact this can be seen 
in [110] where simplifications occur only when one integrates over all momenta except for a 
single hadron. Even in that case, however, the simplification only occurs for contributions 
that are termed " tree- level" . It may be of course that the color structures simplify in the 
strict large Nc limit where N^^ oo. The factorization breaking graphs studied in [78], see 



figure 36, are for example non- leading in Nc- Their effect on the production cross section 
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may still, however, be important if there is no kinematical suppression. 



Finally we note that in more general processes like in figure |39| there is also the soft 
factor which will now be more complicated than in standard TMD factorization. Assuming 
that factorization holds, according to [109] the unsubtracted TMD gluon distribution is a 
highly complicated function containing many different Wilson lines. Each light-like Wilson 
line produces rapidity divergences that must be regulated. In addition to the rapidity 
divergences there appear divergences related to the self energy corrections of the Wilson 
lines. All these divergences are regulated by subtracting the soft factor from the collinear 
region, which leads to definitions like in equation ( p. 361 ). In the case of the gauge link 
structures that appear for figure B^ using the scheme of [109], however, we dare not even 
ask how exactly all these issues would be dealt with. It appears to be an immensely difficult 
task to obtain final definitions of the highly complicated TMD distributions which are free 
from all divergences. Yet this would be extremely helpful for precise phenomenological 
applications. 



5. Summary 

Our main aim has been to provide a coherent analysis of TMD factorization and the TMD 
gluon distribution, especially as used in the small-x region, and to examine many important 
points that usually are not well explained or are overlooked in the literature. 

In section ^ we have given a unified analysis of the concept of factorization in different 
formalisms, the hard scattering formalism (section 3.1), the BFKL formalism (section 3.2) 



and the CGC formalism (section p^ ). We also analyzed in section 3A what we called hybrid 



approaches which combine collinear factorization with the use of TMD distributions. 



The main point in section 3.1 has been to explain what exactly is meant by factorization 
in the hard scattering case, and what approximations and methods are built into the 
analysis. We have then compared these to the small- a: treatments which use somewhat 



different methods. We emphasized in section 3.3.4 the difference between factorization 
which is constructed to be valid to leading power and the leading logarithmic approximation 
(LLA) that is based on the one-loop calculation. As we have explained the former is of 
much greater accuracy and generality which is important to understand when comparing 
the different treatments. 



In section 3^ we explained the idea behind the so-called factorization of mass singulari- 
ties that is built into the hybrid formalisms. Let us note here that it has been demonstrated 
in [1] that for the simplest partonic reactions as relevant for DIS, the method gives the 
same results as the hard scattering factorization for the massless limit of the hard scatter- 
ing coefficient. It is, however, not clear to us whether this still holds in the cases studied 
in the hybrid formalisms, where one includes also TMD distributions, and studies proton- 
nucleus collisions. We also note that the CCH and CCFM formalisms essentially base their 
underlying formulas on the same approach. The use of the method in these formalisms is 
discussed in [42] . We have explained here why this procedure is physically misleading, and 
caution should be taken before trying to move on to more complicated reactions. 
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In section ^ we have given an extensive analysis of single particle production in the 



small-x region. We started by showing in section 4.1 that one can perforin a power counting 



analysis very much as in section 3.1.2 to identify the leading structure. This is crucial 
to understand when the higher order corrections can be neglected and how the asserted 
formulas can be justified. The main factorization formula (4.8) has been extensively used in 



phenomenological applications of small-x QCD, at both RHIC and the LHC. It is therefore 
crucial to understand the physics behind it and the justifications given for its validity. We 
noted that many treatments in the literature are based on the axial gauge, and we therefore 
examined the application of the axial gauge in justifying the factorization formula ([4.^). 
We showed in section [4.3| why the light-cone gauge is inappropriate for the formulation 



while in section 4.4 we showed how one can obtain the standard factorization formula in a 



symmetric axial gauge. 



Then in section 4.5 we demonstrated the technical difficulties with the use of the axial 



gauge and suggested that a more complete treatment be based instead on covariant gauge. 



In section 16 we then discussed the gluon distribution that is associated with ( [4.8D and 
how it could generally be constructed from Feynman graphs, and we examined the graphs 
that are problematic for the full proof of factorization. 

There have lately been many applications of TMD factorization in the small-x region, 
in pp, pA and AA collisions. To fully prove factorization, however, one must show that 



the graphs of the type we showed in section 4^ cancel. In the case of pA collisions we 
emphasize that the gluon couplings from the proton side cannot neglected. In particular 
it does not follow that one can automatically treat the proton using integrated parton 
distributions and fragmentation functions. If the observed particle is at low p± then the 
transverse momentum of the collinear region of the proton and the soft region cannot be 
neglected outside of these regions, and as a consequence TMD distributions must be used 
everywhere. A more complete factorization formula must then be constructed, taking into 



account the difficulties outlined in sections 4.5 and 4.6. 



Finally, a point which did not discuss much here concerns the scattering coefficient in 



the gluon production formula, equation (4.8). Note that this factor diverges as l±_ — )• 0. 



This is in fact a sign that the standard treatment cannot be complete. One should provide 
for the scattering factor a full definition that is valid to all orders, is gauge independent, 
and which contains necessary subtractions to remove all divergences. An example for the 
scattering factor in heavy qq production is given in [89] . 
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